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PREFACE 


In writing this book I have attempted to organize a text on optics which 
is specifically directed toward electrical engineers. As the reader may 
know, several excellent books devoted to the application of Fourier analy- 
sis and linear systems concepts in optics already exist. However, it has 
been my experience that these previous books are best suited for physi- 
cists, who are already rather familiar with the principles of classical 
optics, but perhaps less familiar with the mathematical techniques which 
are now so fruitfully applied in the modern theory of image formation. 
Electrical engineers, on the other hand, are very familiar with the mathe- 
matical techniques through their extensive exposure to network analysis, 
but are relatively weak in the principles of classical optics. It therefore 
seemed reasonable to present the principles of optics to electrical engineers 
in a manner which makes maximum use of the mathematical tools already 
at their disposal. Thus Fourier analysis and linear systems theory provide 
the foundation on which the theory of image formation, optical data 
processing, and holography are constructed. 

The book originated as a set of class notes for a one-quarter course 
on Fourier Optics in the Department of Electrical Engineering at Stanford 
University. The students were, for the most part, in their first or second 
year of graduate study. As the volume of material grew with successive 
revisions of the notes, it became more and more difficult to cover all the 
material in the 30 lectures of a single quarter. Consequently, in later 
versions of the course I have found it necessary to omit (or at best treat 
only briefly) the following sections: 2-3, 3-5, 3-6, 4-3, 6-6, 7-7, and portions 
of 8-8 and 8-9. For a course running a full semester, all the material 
can probably be included. 

I am grateful to many people for their help and encouragement in 
this endeavor. Perhaps my deepest debt is to the members of the Radar 
and Optics Laboratory staff at the University of Michigan, who through 
their publications and through personal contact stimulated my interest 
in optics and greatly influenced my point of view. More specific debts are 
owed to A. E. Seigman and N. Abramson, who provided early encourage- 
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ment at the time it was most needed; E. L. O’Neill, E. N. Leith, L. J. 
Cutrona, and H. Lotsch, who carefully reviewed the manuscript and 
suggested a number of significant improvements; M. Lehmann, who spent 
much of his own time supplying photographs which are a vital part of 
the text; D. W. Jackson and W. H. Huntley, Jr., who provided many 
stimulating discussions; Mrs. Alice Lescalleet, who typed most of the 
manuscript; and the many students who contributed to the education 
of their teacher. Finally, I would like to thank my wife, Hon Mai, without 
whose patience and encouragement this book would not have become a 
reality. 


Joseph W. Goodman 


To my Mother and Father 
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] 7 INTRODUCTION 


1-1 OPTICS AND COMMUNICATION THEORY 


Since the late 1930s the venerable branch of physics known as optics has 
gradually developed ever-stronger ties with the communication and 
information sciences of electrical engineering. This trend is an understand- 
able one, for both communication systems and imaging systems are 
designed to collect or convey information. In the former case the informa- 
tion is generally of a temporal nature (i.e., a modulated voltage or current 
waveform), while in the latter case it is of a spatial nature (i.e., a light 
amplitude or intensity distribution over space), but from an abstract 
point of view this difference is a rather superficial one. 

Perhaps the strongest tie between the two disciplines lies in the 
similar mathematics which can be used to describe the respective systems 
of interest—the mathematics of Fourier analysis and “systems” theory. 
The fundamental reason for the similar mathematics is not merely the 
common interest in ‘information,’ but rather certain basic properties 
which communication systems and imaging systems share. For example, 
many electronic networks and imaging devices share the properties called 
linearity and invariance (for definitions see Chap. 2). Any network or 
device (electronic, optical, or otherwise) which possesses these two 
properties can be described mathematically with considerable ease using 
the techniques of frequency analysis. Thus, just as it is convenient to 
describe an audio amplifier in terms of its (temporal) frequency response, 
so likewise it is often convenient to describe an imaging system in terms 
of its (spatial) frequency response. 

The similarities do not end when the linearity and invariance prop- 
erties are absent. Certain nonlinear optical elements (in particular 
photographic film) have input-output relationships which are directly 
analogous to the corresponding characteristics of nonlinear electronic 
components (diodes, vacuum tubes, etc.), and similar mathematics of 
analysis can be applied in both cases. 
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It is particularly important to recognize that the similarity of the 
mathematical structures can be exploited not only for analysis purposes, 
but also for synthesis purposes. Thus, just as the spectrum of a temporal 
function can be intentionally manipulated in a prescribed fashion, so 
too can the spectrum of a spatial function be modified in various desired 
ways. The recent history of optics is rich with examples of important 
advances achieved by the application of Fourier synthesis techniques— 
the Zernike phase-contrast microscope, optical matched filters, and 
various contrast enhancement techniques, to mention only a few. The 
future will undoubtedly see many more benefits of the marriage of the 
two disciplines, benefits which will enrich both optics and the communi- 
cation sciences. 


1-2. THE BOOK 


The readers of this book are assumed to have at the start a rather solid 
foundation in Fourier analysis and linear systems theory. Chapter 2 
reviews the required background; to avoid boring those who are well 
grounded in the analysis of temporal signals and communication systems, 
the review is conducted for functions of two independent variables. 
Such functions are, of course, of primary concern in optics, and the exten- 
sion from one to two independent variables provides a new richness to the 
mathematical theory, introducing many new properties which have no 
direct counterpart in the theory of electronic signals and systems. 

The physical phenomenon called dzffraction is of the utmost impor- 
tance in the theory of optical imaging systems. Chapter 3 treats the 
foundations of scalar diffraction theory, including the Kirchhoff, Rayleigh- 
Sommerfeld, and ‘angular spectrum’ approaches. In Chap. 4 certain 
approximations to the general results are introduced, namely, the Fresnel 
and Fraunhofer approximations, and examples of diffraction-pattern 
calculations are presented. 

Lenses are, of course, important elements in imaging and optical 
data-processing systems. Chapter 5 considers lenses and their properties. 
A thin lens is modeled as a phase transformation; the usual lens law is 
derived from this model, as are also certain Fourier transforming prop- 
erties of positive lenses. 

Chapter 6 considers the application of frequency analysis techniques 
to both coherent and incoherent imaging systems. Appropriate transfer 
functions are defined and their properties discussed for systems with and 
without aberrations. Coherent and incoherent systems are compared 
from various points of view. Finally, the possibilities of resolving beyond 
the “‘classical’”’ diffraction limit are considered. 
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Attention is turned to synthesis problems in Chap. 7, which considers 
spatial filtering and optical information processing. The historical 
development of Fourier synthesis ideas is first discussed, followed by a 
brief treatment of the basic properties of photographic film. Optical 
processing systems based on geometrical optics are examined, following 
which attention is turned to the more powerful frequency-plane processing 
techniques. Considerable discussion is devoted to the so-called ‘“‘Vander 
Lugt filter,” which allows synthesis of complex-valued transfer functions 
by means of purely attenuating frequency-plane masks. The primary 
applications considered are optical character recognition and optical 
processing of synthetic-aperture-antenna data. 

The final chapter is devoted to the subject of wavefront-reconstruc- 
tion imaging or holography. Wavefront reconstruction is first discussed 
from a very general point of view, following which the particular tech- 
niques developed by Gabor and by Leith and Upatnieks are considered 
in some detail. The extension to three-dimensional lensless photography 
is also discussed. The properties of photographic film and how they affect 
wavefront-reconstruction images are of particular concern, with attention 
devoted to the effects of limited spatial-frequency response, of non- 
linearities, and of emulsion thickness. Generalizations to holography of 
moving objects, holography with incoherent light, and reflection holograms 
are also treated. Finally, the promising applications of holography, 
ranging from microscopy to imaging through aberrating media, are 
outlined. 


? ANALYSIS OF TWO-DIMENSIONAL 
LINEAR SYSTEMS 


Many physical phenomena are found experimentally to share the basic 
property that their response to several stimuli acting simultaneously is 
identically equal to the sum of the responses that each of the component 
stimuli would produce individually. Such phenomena are called linear, 
and the property that they share is called linearity. Electrical networks 
composed of resistors, capacitors, and inductors are usually linear over a 
wide range of inputs. In addition, as we shall soon see, the linearity of 
the wave equation describing the propagation of light through most 
media leads us naturally to regard optical imaging operations as linear 
mappings of ‘‘object’’ light distributions into ‘‘image”’ light distributions. 

The single property of linearity leads to a vast simplification in the 
mathematical description of such phenomena and represents the founda- 
tion of a mathematical structure which we shall refer to here as linear 
systems theory. The great advantage afforded by linearity is the ability to 
express the response (be it voltage, current, light amplitude, or light 
intensity) to a complicated stimulus in terms of the responses to certain 
“elementary” stimuli. Thus if a stimulus is decomposed into a linear 
combination of elementary stimuli, each of which produces a known 
response of convenient form, then by virtue of linearity the total response 
can be found as a corresponding linear combination of the responses to 
the elementary stimuli. 

In this chapter we review some of the mathematical tools that are 
useful in describing linear phenomena, and discuss some of the mathe- 
matical decompositions that are often employed in their analysis. 
Throughout the later chapters we shall be concerned with stimuli (system 
inputs) and responses (system outputs) that may be either of two different 
physical quantities. If the illumination used in an optical system exhibits 
a property called spatial coherence, then we shall find that it is appropriate 
to describe the light as a spatial distribution of complex-valued field 
amplitude. When the illumination lacks spatial coherence, it is appropriate 
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to describe the light as a spatial distribution of real-valued intensity. 
Attention will be focused here on the analysis of linear systems with 
complex-valued inputs; the results for real-valued inputs are thus included 
as special cases of the theory. 


2-1 FOURIER ANALYSIS IN TWO DIMENSIONS 


A mathematical tool of great utility in the analysis of both linear and 
nonlinear phenomena is Fourier analysis. This tool is widely used in the 
study of electrical networks and communication systems; it is assumed 
that the reader has encountered Fourier theory in such applications and 
therefore that he is familiar with the analysis of functions of one independ- 
ent variable (e.g., time). For a review of the fundamental mathematical 
concepts, see the books by Papoulis [Ref. 2-1] and Bracewell [Ref. 2-2]. 
Our purpose here is limited to extending the reader’s familiarity to the 
analysis of functions of two independent variables. No attempt at great 
mathematical rigor will be made, but rather an operational approach, 
characteristic of most engineering treatments of the subject, will be 
adopted. 


Definition and existence conditions 


The Fourier transform (alternatively the Fourier spectrum or frequency 
spectrum) of a complex function! g of two independent variables, x and 
y, will be represented here by ¥{g} and is defined by? 


Fig} = ans g(x,y) exp [—j2a(fxx + fry)] dx dy (2-1) 


The transform so defined is itself a complex-valued function of two 
independent variables fx and fy, which we generally refer to as frequencies. 
Similarly, the inverse Fourier transform of a function G(fx,fy) will be 
represented by $~!{G} and is defined as 


5G} = ff GUfesr) exp li2n(Ixx + fry] Ux dfy (2-2) 


Note that as mathematical operations the transform and inverse trans- 


1 Boldface sans serif type will be used throughout to indicate that a function is 


complex-valued. 
2 When a single limit of integration appears above or below a double integral, 


then that limit applies to both integrations. 
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form are very similar, differing only in the sign of the exponent appearing 
in the integrand. 

Before discussing the properties of the Fourier transform and its 
inverse, we must first decide when the definitions (2-1) and (2-2) are in 
fact meaningful. For certain functions, these integrals may not exist in 
the usual mathematical sense, and therefore this discussion would be 
incomplete without at least a brief mention of “existence conditions.” 
While a variety of sets of sufficient conditions for the existence of (2-1) 
are possible, perhaps the most common set is the following: 


1. g must be absolutely integrable over the infinite zy plane. 

2. g must have only a finite number of discontinuities and a finite 
number of maxima and minima in any finite rectangle. 

3. g must have no infinite discontinuities. 


In general, any one of these conditions can be weakened at the price of 
strengthening one or both of the companion conditions, but such con- 
siderations lead us rather far afield from our purposes here. 

As Bracewell [Ref. 2-2] has pointed out, “physical possibility is a 
valid sufficient condition for the existence of a transform.”’ However, it 
is often convenient in the analysis of systems to represent true physical 
waveforms by idealized mathematical functions, and for such functions, 
one or more of the above existence conditions may be violated. For 
example, it is common to represent a strong, narrow time pulse by the 
so-called Dirac 6 function,! defined by 


6(t) = lim N exp (—N?nt?) 
N- © 


Similarly, an idealized point source of light is often represented by the 
two-dimensional equivalent, 


b(x,y) = tue N?* exp [—N?a(z? + y?)] (2-3) 


Such functions, being infinite at the origin and zero elsewhere, have an 
infinite discontinuity and therefore fail to satisfy existence condition 3. 
Other important examples are readily found; for example, the functions 


f(x,y) abe and f(x,y) = COs (2nfx2) 


both fail to satisfy existence condition 1. 
Evidently, if the majority of functions of interest are to be included 
within the framework of Fourier analysis, some generalization of the 


‘ For a more detailed discussion of the 6 function, including alternative definitions, 
see Sec. A in the appendix. 
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definition (2-1) is required. Fortunately, it is often possible to find a 
meaningful transform of functions that do not strictly satisfy the exist- 
ence conditions, provided those functions can be defined as the limit of a 
sequence of functions that are transformable. By transforming each 
member function of the defining sequence, a corresponding sequence of 
transforms is generated, and we call the limit of this new sequence the 
generalized Fourier transform of the original function. Generalized trans- 
forms can be manipulated in the same manner as conventional trans- 
forms, and the distinction between the two cases can generally be ignored, 
it being understood that when a function fails to satisfy the existence 
conditions and yet is said to have a transform, then the generalized 
transform is actually meant. For a more detailed discussion of this 
generalization of Fourier analysis the reader is referred to the book by 
Lighthill [Ref. 2-3]. 

To illustrate the calculation of a generalized transform, consider 
the Dirac 6 function, which has been seen to violate existence condition 
3. Note that each member function of the defining sequence (2-3) does 
satisfy the existence requirements and that each, in fact, has a Fourier 
transform given by (see Table 2-1) 


§{N? exp [—N?r(x? + y?)]} = exp | - 


RES E ad 
N? 


Accordingly the generalized transform of 6(z,y) is found to be 


r(fx? + ich as wd 


5{6(x,y)} = lim {exp | - a 


N- © 
Evidently the spectrum of a 6 function extends uniformly over the entire 
frequency domain. 
For other examples of generalized transforms see Table 2-1. 


The Fourier transform as a decomposition 


As mentioned previously, when dealing with linear systems it is often 
useful to decompose a complicated input into a number of more simple 
inputs, to calculate the response of the system to each of these “elemen- 
tary” functions, and to superimpose the individual responses to find the 
total response. Fourier analysis provides a basic means of performing one 
such decomposition. Consider the familiar inverse-transform relationship 


g(t) = [7, GU) exp (i2nft) df 


expressing the time function g in terms of its frequency spectrum. We 
may regard this expression as a decomposition of the function g(t) into a 
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re 
Lines of zero phase 


Figure 2-1 Lines of zero phase for the function exp [j21(fxx + fry)]. 


linear combination (i.e., an integral) of elementary functions, each with 
the specific form exp (j2zft). Evidently the complex number G(f) is 
simply a weighting factor that must be applied to the elementary function 
of frequency f in order to synthesize the desired g(t). 

In a similar fashion, we may regard the two-dimensional Fourier 
transform as a decomposition of a function g(x,y) into a linear combina- 
tion of elementary functions of the form exp [j27(fxx + fyy)]. Such 
functions have a number of interesting properties. Note that for any 
particular frequency pair (fx,fy), the corresponding elementary function 
has zero phase along lines described by 


n 
y= Jx x+— (n an integer) 
fr fy 


Thus, as indicated in Fig. 2-1, this elementary function may be regarded 
as being “directed” in the zy plane at an angle @ (with respect to the 
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x axis) given by 
ty 


xX 


6 = tan—! (2-5) 
In addition, the spatial period (i.e., the distance between zero-phase lines) 
is evidently given by 


1 
Vist + fe 


In conclusion, then, we may again regard the inverse Fourier trans- 
form as providing a means of decomposing mathematical functions. The 
Fourier spectrum G of a function g is simply a description of the weight- 
ing factors that must be applied to each elementary function in order to 
synthesize the desired g. The real advantage to using this decomposition 
will not be fully evident until our later discussion of invariant linear 
systems. 


L (2-6) 


Fourier transform theorems 


The basic definition (2-1) of the Fourier transform leads to a rich mathe- 
matical structure associated with the transform operation. We now 
consider a few of the basic mathematical properties of the transform, 
properties that will find wide use in later material. These properties are 
presented as mathematical theorems, followed by a brief statement of 
their physical significance. Since these theorems are direct extensions of 
the analogous one-dimensional statements, the proofs are deferred to 
the appendix. 


1. Linearity theorem. S{ag + Bh} = aS{g} + BF{h}; that is, the trans- 
form of a sum of two functions is simply the sum of their individual 
transforms. 

2. Similarity theorem. If S{g(x,y)} = G(fx,fy), then 


ib ve pe 
F{g(ar,bu)} = jy G (2,2) 


that is, a “stretching” of the coordinates in the space domain (z,y) 
results in a contraction of the coordinates in the frequency domain 
(fx,fy), plus a change in the overall amplitude of the spectrum. 

3. Shift theorem. If S{g(x,y)} = G(fx,fy), then 


{g(x — a, y — b)} = G(fx,fr) exp [—j2a(fxa + frbd)] 


that is, translation of a function in the space domain introduces a 
linear phase shift in the frequency domain. 
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4, 


Parseval’s theorem. If S{g(z,y)} = G(fx,fy), then 


[flow ax ay = ff \GGxfy)I? dfx afy 
This theorem is generally interpretable as a statement of conserva- 
tion of energy. 

Convolution theorem. If F{g(zx,y)} = G(fx,fy) and 


F{h(x,y) } a H(fx,fy) 
then 


5 | i g(én)h(x — & y — 0) dé dn) = G(fx.fyHfxofy) 


The convolution of two functions in the space domain (an operation 
that will be found to arise frequently in the theory of linear systems) 
is entirely equivalent to the more simple operation of multiplying 
their individual transforms. 

Autocorrelation theorem. If S{g(z,y)} = G(fx,fy), then 


5 i g(éna*(E — 2,1 — y) dé dn} = |G(fx,fy)|" 
Similarly, 


F{la(tml} = ff GlEmG*(E — fx, 1 — fy) aE dy 


This theorem may be regarded as a special case of the convolution 
theorem. 
Fourier integral theorem. At each point of continuity of g 


55—"{g(z,y)} = F'F{g(x,y)} = g(x,y) 


At each point of discontinuity of g, the two successive transforms 
yield the angular average of the value of g in a small neighborhood 
of that point. That is, the successive transformation and inverse 
transformation of a function yields that function again, except at 
points of discontinuity. 


The above transform theorems are of far more than just theoretical 


interest. They will be used frequently, since they provide basic tools for 
the manipulation of Fourier transforms and can save enormous amounts 
of work in the solution of Fourier analysis problems. 
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Separable functions 


A function of two independent variables is called separable with respect 
to a specific coordinate system if it can be written as a product of two 
functions, each of which depends on only one independent variable. 
Thus a function g is separable in the rectangular coordinates (x,y) if 


g(x,y) = gx(x)gr(y) (2-7) 
while it is separable in polar coordinates (r,@) if 
g(7,8) = gr(r)go(9) (2-8) 


Separable functions are often more convenient to deal with than 
more general functions, for separability often allows complicated two- 
dimensional manipulations to be reduced to more simple one-dimensional 
manipulations. For example, a function separable in rectangular coor- 
dinates has the particularly simple property that its two-dimensional 
Fourier transform can be found as a product of one-dimensional Fourier 
transforms, as evidenced by the following relation: 


Fi g(z,y)} ik g(x,y) exp [—j2m(fxa + fry)] dx dy 


[2 ox(@) exp [—j2nfxa] de [”_ gv(y) exp [—j2afryl dy 
Fxlgx}Frigv} (2-9) 


Thus the transform of g is itself separable into a product of two factors, 
one a function of fx only and the second a function of fy only, and the 
process of two-dimensional transformation simplifies to a succession of 
more familiar one-dimensional manipulations. 

Functions separable in polar coordinates are not so easily handled 
as those separable in rectangular coordinates, but it is still generally 
possible to demonstrate that two-dimensional manipulations can be 
performed by means of a series of one-dimensional manipulations. For 
example, the process of Fourier transforming a function separable in 
polar coordinates is considered in the problems (see Prob. 2-7), where 
the reader is asked to verify that the two-dimensional spectrum can be 
found by performing a series of one-dimensional operations called Hankel 
transforms. 


Functions with circular symmetry: Fourier-Bessel transforms 


Perhaps the simplest class of functions separable in polar coordinates is 
composed of those possessing circular symmetry. The function g is said to 
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be circularly symmetric if it can be written as a function of radius r 
alone, that is, 
g(7,8) = gr(r) (2-10) 

Such functions play a particularly important role in the problems of 
interest here, since most optical systems have precisely this type of 
symmetry. We accordingly devote special attention to the problem of 
Fourier transforming a circularly symmetric function. 

The Fourier transform of g in a system of rectangular coordinates 
is, of course, given by 


G(fx.fy) = ff a(eu) exp (—j2e(fxx + fry) dedy (2-11) 


To fully exploit the circular symmetry of g, we make a transformation to 
polar coordinates in both the zy and fxfy planes as follows: 


r=Vo27-+ 7 x =rcosé@ 


1 
6 = tan"! (“) y =rsin 6 
ay 


2-12 
Pi=M fx* J r* fx =pcos¢ ( ) 


simi) 


For the present we write the transform as a function of both radius and 
angle, 


psin d 


F{g} = Golp,¢) 


Applying the coordinate transformations (2-12) to Eq. (2-11), the 
Fourier transform of g can be written 


2a 


G.(p,¢) = i dé ii dr - rgr(r) exp [—j2mrp(cos 6 cos ¢ + sin @ sin ¢)] 
or equivalently, 
) 2r t 

Gi(p,¢) = ik dr: rgr(r) if dé exp [—j2mrp cos (@ — ¢)] (2-13) 

Finally, we use the Bessel function identity 
1 2a 
vod) = oe ip exp [—ja cos (8 — ¢)] da (2-14) 

where Jo is a Bessel function of the first kind, zero order, to simplify the 


expression for the transform. Substituting (2-14) in (2-13), the depend- 
ence of the transform on angle ¢ is seen to disappear, leaving Go as the 
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following function of radius p, 


Gini. if * rge(r)Jo(2erp) dr (2-15) 


Thus the Fourier transform of a circularly symmetric function is 
itself circularly symmetric and can be found by performing the one- 
dimensional manipulation (2-15). This particular form of the Fourier 
transform occurs frequently enough to warrant a special designation; 
the expression (2-15) is accordingly referred to as the Fourier-Bessel 
transform, or alternatively, as the Hankel transform of zero order. For 
brevity we adopt the former terminology. 

By means of arguments identical with those used above, the inverse 
Fourier transform of a circularly symmetric function Go(p) can be 
expressed as 


gx(r) = 2 [> pGo(p)Fo(2nrp) dp 


Thus, for circularly symmetric functions there is no difference between 
the transform and inverse-transform operations. 

Using the notation ®{ } to represent the Fourier-Bessel transform 
operation, it follows directly from the Fourier integral theorem that 


BB {gr(r)} = BB{gr(r)} = ga(r) 


at each value of r where gr(r) is continuous. In addition, the s¢milarity 
theorem can be straightforwardly applied (see Prob. 2-4) to show that 


Bl ge(ar)} = 5G (*) 


When using the expression (2-15) for the Fourier-Bessel transform, 
the reader should remember that it is no more than a special case of the 
two-dimensional Fourier transform, and therefore any familiar property 
of the Fourier transform has an entirely equivalent counterpart in the 
terminology of Fourier-Bessel transforms. 


Some frequently used functions and some useful Fourier transform pairs 


A number of mathematical functions will find such extensive use in later 
material that considerable time and effort can be saved by assigning 
them special notations of their own. Accordingly, we adopt the following 
definitions of some frequently used functions: 


Rectangle function 
1 |el< 


rect (z) = 0 otherwise 
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Sinc function 


in 
sinc(z) = Ee 
Tx 
Sign function 
1 a) 
sen(r) = 4 0 Zi 
—1 Tia 
Triangle function 
tz) co cl 
AO {4 otherwise 


Comb function 
comb(t) = )) d(x — n) 
Circle function 


ne te tem eece 


0 otherwise 


The first five of these functions, depicted in Fig. 2-2, are all functions of 
only one independent variable; however, a variety of separable functions 
on a two-dimensional space can be formed by means of products of these 
functions. The circle function is, of course, unique to the case of two 
independent variables; see Fig. 2-3 for an illustration of its structure. 

We conclude our discussion of Fourier analysis by presenting some 
specific two-dimensional transform pairs. Table 2-1 lists a number of 
transforms of functions separable in rectangular coordinates. Since the 
transforms of such functions can be found directly from products of 


Table 2-1 Transform pairs for some functions 
separable in rectangular coordinates 


Function Transform 
exp [—7(x? + y?)] exp [—m(fx? + fr?)] 
rect(x) rect(y) sinc(fx) sine(fy) 
A(x) A(y) sinc?(fx) sine?(fy) 
6(2x,y) 1 
exp [jr(x + y)] O(fx —14, Jy —%3) 

i 1 


sgn(x) sgn(y) So 
‘cece Iuhx jufy 


comb(x) comb(y) comb(fx) comb(fr) 
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rect (2) 


1 
2 


comb (z) 


3-2 -1 0) 1 2 3 
Figure 2-2 Special functions. 


familiar one-dimensional transforms, the proofs of these relations are 
left to the reader (see Prob. 2-2). 

On the other hand, transforms of most circularly symmetric func- 
tions cannot be found simply from a knowledge of one-dimensional 
transforms. The most frequently encountered function with circular 
symmetry is: 

circ(r) = | d Weed : 
0 otherwise 


Accordingly, some effort is now devoted to finding the transform of this 
function. Using the Fourier-Bessel transform expression (2-15), the 
transform of the circle function can be written 


1 
@{cire(r)} = anf, rJ o(2rrp) dr 
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Figure 2-3 The circle function and its transform. 
Using a change of variables, 7’ = 2rrp, and the identity 
fF sFol®) dé = xi) 


we rewrite the transform as 


21p a Ji(27p) (2-16) 
p 


; 1 
@{eire(r)} = Sap ih r’J o(r’) dr’ 


where J; is a Bessel function of the first kind, order one. Figure 2-3 
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illustrates the circle function and its transform. Note that the transform is 
circularly symmetric, as expected, and consists of a central spike and a 
series of concentric rings of diminishing amplitude. As a matter of 
curiosity we note that the zeros of this transform are not equally spaced 
in radius. For a number of additional Fourier-Bessel transform pairs, 
the reader is referred to the problems (see Prob. 2-4). 


2-2, LINEAR SYSTEMS 


For the purposes of our discussions here, we seek to define the word 
system in a way sufficiently general to include both the familiar case of 
electrical networks and the less-familiar case of optical imaging devices. 
Accordingly, a system is defined to be a mapping of a set of input functions 
into a set of output functions. For the case of electrical networks, the 
inputs and outputs are real functions (voltages or currents) of a one- 
dimensional independent variable (time) ; for the case of imaging systems, 
the inputs and outputs can be real-valued functions (intensity) or 
complex-valued functions (field amplitude) of a two-dimensional inde- 
pendent variable (space). As mentioned previously, the question of 
whether the intensity or the field amplitude should be considered the 
system variable will be treated at a later time. 

If attention is restricted to deterministic (nonrandom) systems, 
then a specified input must map into a unique output. It is not necessary, 
however, that each output correspond to a unique input, for as we shall 
see, a variety of input functions can produce no output. Thus we restrict 
attention at the outset to systems characterized by many-one mappings. 

A convenient representation of a system is a mathematical operator, 
S{  }, which we imagine to operate on input functions to produce output 
functions. Thus, if the function gi(x1,y1) represents the input to a system, 
and go(22,y2) represents the corresponding output, then by the definition 
of §{ }, the two functions are related through 


G2(X2,Y2) = S{gilxi,y1)} (2-17) 


Without specifying more detailed properties of the operator §{ }, 
it is difficult to state more specific properties of a general system than 
those expressed by Eq. (2-17). In the material that follows, we shall be 
concerned primarily, though not exclusively, with a restricted class of 
systems that are said to be linear. The assumption of linearity will be 
found to yield simple and physically meaningful representations of such 
systems; it will also allow useful relations between inputs and outputs 
to be developed. 
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Linearity and the superposition integral 


A system is said to be linear if the following superposition property is 
obeyed for all input functions t and s and all complex constants a and b: 


S{as(x1,y1) + bt(ri,y1)} = aS {s(ai,yi)} + bS{t(ri,y1)} (2-18) 


As mentioned previously, the great advantage afforded by linearity is the 
ability to express the response of the system to an arbitrary input in 
terms of the responses to certain ‘‘elementary” functions into which the 
input has been decomposed. It is most important, then, to find a simple 
and convenient means of decomposing the input. Such a decomposition is 
offered by the so-called sifting property of the 6 function (cf. See. A in the 
appendix), which states that 


ai(euys) = ff arlEn)oer — £ yx — 9) dE dr (2-19) 


This equation may be regarded as expressing g; as a linear combination 
of weighted and displaced 6 functions; the elementary functions of the 
decomposition are, of course, just these 6 functions. 

To find the response of the system to the input gi, substitute (2-19) 
in (2-17): 


92(12,y2) = § (ff gilé,n)d(v1 — &, yr — ) dé dn} 


Now, regarding the number g:(é,y) as simply a weighting factor applied 
to the elementary function 6(11 — &, y: — 7), the linearity property (2-18) 
is invoked to allow §{ } to operate on the individual elementary 
functions; thus the operator §{ } is brought within the integral, yielding 


ga(zays) = ff arltsn)S{a(er — & yn — 2)} dE ay 


As a final step we let the symbol h(x2,y2;£,) denote the response of the 
system at point (x2,¥2) of the output space to a 6 function input at coor- 
dinates (é,n) of the input space; that is, 


h(x2,yo;En) = ${8(t1 — & yi — 0)} (2-20) 


The function h is called the impulse response of the system. The system 
input and output can now be related by the simple equation 


ga(zays) = [ff ax(Enh(ws,y2stn) dé dy (2-21) 
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This fundamental expression, known as the superposition integral, 
demonstrates the very important fact that a linear system is completely 
characterized by its response to unit impulses. To completely specify 
the output, the responses must in general be known for impulses located 
at all possible points in the input plane. For the case of a linear imaging 
system, this result has the interesting physical interpretation that the 
effects of imaging elements (lenses, stops, etc.) can be fully described by 
specifying the (possibly complex-valued) images of point sources located 
throughout the object field. 


Invariant linear systems: transfer functions 


Having examined the input-output relations for a general linear system, 
we turn now to an important subclass of linear systems, namely, invariant 
linear systems. An electrical network is said to be time-invariant if its 
impulse response A(t;r) (that is, its response at time ¢ to a unit-impulse 
excitation applied at time r) depends only on the time difference (¢ — 7). 
Electrical networks composed of fixed resistors, capacitors, and inductors 
are time-invariant since their characteristics do not change with time. 

In a similar fashion, a linear imaging system is said to be space- 
tnvariant (or equivalently, zsoplanatic) if its impulse response h(2x2,y2;£,n) 
depends only on the distances (x. — £&) and (yz — 7). For such a system 
we can, of course, write 


h(x2,y2;&n) = h(xe — E, y2 — n) (2-22) 


Thus an imaging system is space-invariant if the image of a point-source 
object changes only in location, not in functional form, as the point 
source explores the object field. In practice, imaging systems are seldom 
isoplanatic over their object field, but it is usually possible to divide the 
object field into small regions (isoplanatic patches) within which the 
system is approximately invariant. To completely describe the imaging 
system, the-impulse response appropriate to each isoplanatic patch should 
be specified; but if the particular portion of the object field of interest is 
sufficiently small, it often suffices to consider only the isoplanatic patch 
on the axis of the system. Note that for an invariant system the super- 
position integral (2-21) takes on the particularly simple form 


go(rays) = ff ailémh(es — & ye — 1) dé dn (2-28) 


which we recognize as a two-dimensional convolution of the object function 
with the impulse response of the system. In the future it will be convenient 
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to have a shorthand notation for a convolution relation such as (2-23), 
and accordingly this equation is rewritten 


g2 =gixh 


where an asterisk between any two functions is a convenient symbol 
indicating that those functions are to be convolved. 

The class of invariant linear systems has associated with it a far 
more detailed mathematical structure than the more general class of all 
linear systems, and it is precisely because of this structure that invariant 
systems are so easily dealt with. The simplicity of invariant systems 
begins to be evident when we note that the convolution relation (2-23) 
takes on a particularly simple form after Fourier transformation. Spe- 
cifically, transforming both sides of (2-23) and invoking the convolution 
theorem, the spectra G2(fx,fy) and Gi(fx,fy) of the system output and 
input are seen to be related by the simple equation 


G2(fx,fy) = H(fx,fy)Gilfx,fr) (2-24) 


where H is the Fourier transform of the impulse response 


H(fx,fv) = ff h(én) exp [—j2e(fxé + frn)]dedn (2-25) 


The function H, called the transfer function of the system, indicates the 
effects of the system in the ‘‘frequency domain.”’ Note that the relatively 
tedious convolution operation (2-23) required to find the system output 
is replaced in (2-24) by the often more simple sequence of Fourier 
transformation, multiplication of transforms, and inverse Fourier 
transformation. 

From another point of view, we may regard the relations (2-24) and 
(2-25) as indicating that, for linear invariant systems, the input can be 
decomposed into elementary functions that are more convenient than 
the 6 functions of Eq. (2-19). These alternative elementary functions are, 
of course, the complex-exponential functions. By transforming g; we are 
simply decomposing the input into complex-exponential functions of 
various spatial frequencies (fx,fy). Multiplication of the input spectrum 
G, by the transfer function H then takes into account the effects of the 
system on each elementary function. Note that these effects are limited 
to an amplitude change and a phase shift, as evidenced by the fact that 
we simply multiply the input spectrum by a complex number H(fx,fy) 
at each (fx,fy). Inverse transformation of the output spectrum G, simply 


synthesizes the output g2 by adding up all the modified elementary 
functions. 
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Finally, it should be strongly emphasized that the simplifications 
afforded by transfer-function theory are only applicable for invariant 
linear systems. For applications of Fourier theory in the analysis of 
time-varying electrical networks, the reader may consult Ref. 2-4; 
applications of Fourier analysis to space-variant imaging systems can be 
found in Ref. 2-5. 


2-3 TWO-DIMENSIONAL SAMPLING THEORY 


It is often convenient, both for data processing and for mathematical 
analysis purposes, to represent a function g(z,y) by an array of its 
sampled values taken on a discrete set of points in the zy plane. Intu- 
itively, it is clear that if these samples are taken sufficiently close to each 
other, the sampled data are an accurate representation of the original 
function in the sense that g can be reconstructed with considerable 
accuracy by simple interpolation. It is a less obvious fact that for a 
particular class of functions (known as bandlimited functions) the recon- 
struction can be accomplished exactly, providing only that the interval 
between samples is not greater than a certain limit. This result was 
originally pointed out by Whittaker [Ref. 2-6] and was later popularized 
by Shannon [Ref. 2-7] in his studies of information theory. 

The sampling theorem applies to the class of bandlimited functions, 
by which we mean functions with Fourier transforms that are nonzero 
over only a finite region ® of the frequency space. We consider first a 
form of this theorem that is directly analogous to the one-dimensional 
theorem used by Shannon. Later we very briefly indicate improvements 
of this theorem that can be made in some two-dimensional cases. 


The Whittaker-Shannon sampling theorem 


To derive what is perhaps the simplest version of the sampling theorem, 
we consider a rectangular lattice of samples of the function g, as defined by 


g;(x,y) = comb ) comb (4) g(x,y) (2-26) 


The sampled function g, thus consists of an array of 6 functions, spaced 
at intervals of width X in the z direction and width Y in the y direction 
as illustrated in Fig. 2-4. The area under each 6 function is proportional 
to the value of the function g at that particular point in the rectangular 
sampling lattice. As implied by the convolution theorem, the spectrum G, 
of g, can be found by convolving the transform of comb(«/X) comb(y/Y) 
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Figure 2-4 The sampled function. 


with the transform of g, or 


G.(fxfy) = {comb (=) comb (2)| * G(fx,fy) 


where the asterisk again indicates that a convolution is to be performed. 
Now using Table 2-1 and the similarity theorem, we have 


§ {comb (=) comb (2)| = XY comb(Xfx) comb(Yfy) 


while from the results of Prob. 2-10, 


XY comb(Xfx) comb(¥fr)= Y Y 6 @ — ir — a 


Te 08! 1 —— 00) 


It follows that the spectrum of the sampled function is given by 


CiGzfy) Se) atc (4s si = a= =) (2-27) 


n=—-eo m=—o 


Evidently the spectrum of g, can be found simply by erecting the spec- 


trum of g about each point (n/X,m/Y) in the fxfy plane as shown in 
Fig. 2-5. 
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Since the function g is assumed to be bandlimited, its spectrum G 
is nonzero over only a finite region ® of the frequency space. As implied 
by Eq. (2-27), the region over which the spectrum of the sampled function 
is nonzero can be found by constructing the region ® about each point 
(n/X,m/Y) in the frequency plane. Now it becomes clear that if X and 
Y are sufficiently small (i.e., the samples are sufficiently close together), 
then the separations 1/X and 1/Y of the various spectral regions will be 
great enough to assure that adjacent regions do not overlap (see Fig. 2-5). 
Thus recovery of the original spectrum G from G, can be accomplished 
exactly by passing the sampled function g, through a linear filter that 
transmits the term (n = 0, m = 0) of Eq. (2-27) without distortion, 
while perfectly excluding all other terms. Thus, at the output of this 
filter we find an exact replica of the original data g(z,y). 

As stated in the above discussion, to successfully recover the original 
data it is necessary to take the samples close enough together to enable 
separation of the various spectral regions of G,. To determine the max- 
imum allowable separation between samples, let 2Bx and 2By represent 
the widths in the fx and fy directions, respectively, of the smallest rec- 
tangle! that completely encloses the region ®. Since the various terms in 
the spectrum (2-27) of the sampled data are separated by distances 
1/X and 1/Y in the fx and fy directions, respectively, separation of the 
spectral regions is assured if 


(2-28) 


1 For simplicity we assume that this rectangle is centered on the origin. If this 
is not the case, the arguments can be modified in a straightforward manner to yield 
a somewhat more efficient sampling theorem. 


(a) 
Figure 2-5 Spectra of (a) the original function and (b) the sampled data. 
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The maximum spacings of the sample lattice for exact recovery of the 
original function are thus (2Bx)~! and (2By)7}. 

Having determined the maximum allowable distances between 
samples, it remains to specify the exact transfer function of the filter 
through which the sampled data should be passed. In many cases there 
is considerable latitude of choice here, since for many possible shapes of 
the region ® there are a multitude of transfer functions that will pass the 
(n = 0, m = 0) term of G, and exclude all other terms. For our purposes, 
however, it suffices to note that if the relations (2-28) are satisfied, there 
is one transfer function that will always yield the desired result regardless 
of the specific shape of ®, namely, 


H(fx,fy) = rect (=) rect (=) (2-29) 


The exact recovery of G from G, is seen by noting that the spectrum 
of the output of such a filter is 


G.(fx,fy) rect (= -) rect (= -) = G(fx,fr) 


The equivalent identity in the space domain is 


| comb (=) comb (2) acu) | *h(z,y) = g(x,y) (2-30) 


where h is the impulse response of the filter 


h(z,y) 


I rect (=) rect (= ns -) exp [j2r(fxx + fry)] dfx dfy 
= 4BxBy sinc(2Bxz) sine(2Byy) 


Noting that 


comb (=) comb (7) g(x,y) 


= Xa * > g(nX,mY) d(x — nX, y — mY) 
Eq. (2-30) becomes 


g(x,y) = 4BxyByXY 


> » g(nX,mY) since[2Bx(x — nX)] sinc[2By(y — mY)] 


n=—o m=—o 
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Finally, when the sampling intervals X and Y are taken to have their 
maximum allowable values, the identity becomes 


g(x,y) = y y a(t *) 


n=—0 m=—o 


5 n : m 
sinc |2Bs (: - =) sine | 28 (u — om) (2-31) 


Equation (2-31) represents a fundamental result which we shall refer 
to as the Whittaker-Shannon sampling theorem. It implies that exact 
recovery of a bandlimited function can be achieved from an appropriately 
spaced rectangular array of its sampled values; the recovery is accom- 
plished by injecting, at each sample point, an interpolation function 
consisting of a product of sine functions. 

The above result is by no means the only possible sampling theorem. 
Two rather arbitrary choices were made in the analysis, and alternative 
assumptions at these two points will yield alternative sampling theorems. 
The first arbitrary choice, appearing early in the analysis, was the use of 
a rectangular sampling lattice. The second, somewhat later in the analysis, 
was the choice of the particular transfer function (2-29). Alternative 
theorems derived by making different choices at these two points are, 
of course, no less valid than Eq. (2-31); in fact, in some cases alternative 
theorems can be more “‘efficient”’ in the sense that fewer samples per unit 
area are required to assure complete recovery. The reader interested in 
further pursuing this extra richness of the multidimensional sampling 
theory is referred to the works of Bracewell [Ref. 2-8] and of Peterson 
and Middleton [Ref. 2-9]. In addition, sampling theorems involving the 
values of derivatives of the function as well as the function itself have 
been discussed by Linden [Ref. 2-10]. 


PROBLEMS 
2-1 Prove the following properties of 6 functions: 


1 
(a) d(az,by) = — 4(2,y) 
|ab| 


(b) comb(axr) comb(by) = a a ay (2-2 a y— =) 


2-2. Prove the following Fourier transform relations: 
(a) F{rect(x) rect(y)} = sinc( fx) sinc( fy) 
(b) F{A(ax) A(y)} = since?(fx) sinc*( fy) 
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2-3 


2-6 


Prove the following generalized Fourier transform relations: 


(c) F{1} = d(fr,fy) 


(aisiseneysent(yiee (Me) (es 
) F{sgn(x) sgn(y)} = Geis ) Nee 


Prove the following Fourier transform theorems: 
(a) FF {g(z,y)} = FF {g(z,y)} = g(—z,—y) at all points of continuity of g. 
(b) F{g(x,y)h(x,y)} = Flg(x,y)} * F{h(z,y) } 


(c) F{V%g(z,y)} = —4r2(fx? + fy?) F{g(z,y) } where V? is the laplacian operator 
oe? oe? 
Vie eas 
ox? dy? 


Prove the following Fourier-Bessel transform relations: 
(a) If gr(r) = 6(r — ro), then 


@{gr(r)} = 2rroJ o(2rrop) 
(b) If gr(r) = 1 for a < r < I and zero otherwise, then 


Ji(2xp) — aJi(2rap) 
p 


B{gz(r)} = 
(c) If B{gr(r)} = G(p), then 
®{gr(ar)} = =G (*) 
a a 
(d) @lexp (—mr?)} = exp ( —7zp?) 


The expression 


p(z,y) = g(x,y) * | com (2) comb (4) 


defines a periodic function, with period X in the z direction and period Y in the 
y direction. 
(a) Show that the Fourier transform of p can be written 


rimtr= YY 6 (BF )a(te- 2-2) 


n=—-2 m=—o 


where G is the transform of g. 
(b) Sketch the function p(z,y) when 


zx y 
yy) = t{ 2— = 
g(2z,y) rec ( :) rect (2 ) 


and find the corresponding transform P(fx,fy). 


Let the transform operators ¥4{ } and $s{ } be defined by 


1 ¥ Qa 
Falg} as lf 9(é,n) exp | i (ret + tem Jaca 


1 ' 1 
Fa{g} aE lf sen) exp| 5 ct + um [asa 
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(a) Find a simple expression for 
Fa{Fala(x,y)}} 
(b) Interpret the results fora > b anda < b. 


2-7 Let g(r,@) be separable in polar coordinates. 
(a) Show that if g(7,6) = gr(r)e’™®, then 


F{a(r,6)} = (—7)™e"?Hm {gr(r) } 


where Hm{  } is a Hankel transform of order m, 


KRm{gr(r)} = 2r fe rgr(7)J m(2nrp) dr 


and (p,¢) are polar coordinates in the frequency space. 


o 


HINT: exp (jasin x) = » Ji(a) exp (jka) 
k=—o© 


(6) Show that for a more general case of an arbitrary angular dependence ga(6), 
the Fourier transform can be expressed by the following infinite series of Hankel 
transforms: 


$(9(r,8)} = D) col —j)Fe*6ICg fan) } 


k=—© 


1 2a 
where Cc = — i ga(0)e—7*9 da 
2r JO 


2-8 Suppose that a sinusoidal input 
g(x,y) = cos [2n(fxx + fry)] 


is applied to a linear system. Under what (sufficient) condition is the output a 
real sinusoidal function of the same spatial frequency as the input? Express 
the amplitude and phase of that output in terms of an appropriate character- 
istic of the system. 


2-9 Show that a function with no nonzero spectral components outside a circle of 
radius B in the frequency plane obeys the following sampling theorem: 


2 2 
orB eat wees 
wy (te ne 2 55 € ws) | 
g(x,y) = > 32 2B 2B) )2 iN" = ay 
n=—0 m=—o 208 (=~ 55) + (Ga) 


2B 2B 


2-10 The Fourier transform operator may be regarded as a mapping of functions 
into their transforms and therefore satisfies the definition of a system as pre- 
sented in this chapter. 

(a) Is this system linear? 
(b) Can you specify a transfer function that characterizes this system? If so, 
what is it? If not, why not? 
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2-11 


2-12 


2-13 


The “equivalent area’ Axy of a function g(z,y) can be defined by 


ff a(2,y) de dy 


Axy = 
= 9(0,0) 


while the ‘equivalent bandwidth” of g is defined in terms of its transform G by 


ff G( fx,fy) dfx dfy 


A = 
Sxfy G(0,0) 


Show that AxyAsyry = 1 


A certain complex-valued function of two independent variables (x,y) has a 
spatial Fourier transform that-is identically zero outside the region |fx| < Bx, 
|fy| < By in the frequency domain. Show that the portion of this function 
extending over the region |z| < X, |y| < Y in the space domain can be specified 
(approximately) by 32BxByXY real numbers. Why is this only an approxima- 
tion, and when will it be a good one? (The number 16BxByXY is commonly 
called the space-bandwidth product of the portion of the function considered.) 


The input to a certain imaging system is an object complex-field distribution 
U.(z,y) of unlimited spatial frequency content, while the output of the system 
is an image field distribution U;(z,y). The imaging system can be assumed to 
act as a linear space-invariant lowpass filter with a transfer function that is 
identically zero outside the region |fx| < Bx, |fy| < By in the frequency 
domain. Show that there exists an ‘“‘equivalent’’ object U(z,y), consisting of a 
rectangular array of point sources, that produces exactly the same image U; 
as does the true object U,, and that the field distribution across the equivalent 
object can be written 


U,(2,y) = > > | fe U.(£,n) sine(n — 2Bxé) sine(m — 2Byn) dé an | 


n=—-27m=—o 


5 d n m 
7) QB et Mal Sis 
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3 FOUNDATIONS OF SCALAR 
DIFFRACTION THEORY 


The phenomenon known as diffraction plays a role of the utmost impor- 
tance in the branches of physics and engineering that deal with wave 
propagation. In this chapter we consider some of the foundations of 
scalar diffraction theory. While the theory discussed here is sufficiently 
general to be applied in other fields, such as acoustic-wave propagation 
and radio-wave propagation, the applications of primary concern will be 
in the realm of physical optics. To fully understand the properties of 
optical imaging and optical data-processing systems, it is essential that 
diffraction and the limitations it imposes on system performance be 
appreciated. A variety of references to more comprehensive treatments 
of diffraction theory will be found in the material that follows. 


3-1 HISTORICAL INTRODUCTION 


The term diffraction has been conveniently defined by Sommerfeld 
[Ref. 3-1] as “any deviation of light rays from rectilinear paths which 
cannot be interpreted as reflection or refraction.’’ The first accurate 
report and description of such a phenomenon was made by Grimaldi 
and was published in the year 1665, shortly after his death. The measure- 
ments reported were made with an experimental apparatus similar to that 
shown in Fig. 3-1: an aperture in an opaque screen was illuminated by a 
light source, and the light intensity was observed across a plane some 
distance behind the screen. The corpuscular theory of light propagation, 
which was the accepted means of explaining optical phenomena at the 
time, predicted that the shadow behind the screen should be well defined, 
with sharp borders. Grimaldi’s observations indicated, however, that the 
transition from light to shadow was gradual rather than abrupt. If the 
quality of his light source had been better, he might have observed even 
more striking results, such as the presence of light and dark fringes 
extending far into the geometrical shadow of the screen. Such effects 
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Figure 3-1 Apparatus for observing diffraction of light. 


cannot be satisfactorily explained by a corpuscular theory of light, which 
requires rectilinear propagation of light rays in the absence of reflection 
and refraction. 

The initial step in the evolution of a theory that would explain such 
effects was made by the first proponent of the wave theory of light, 
Christian Huygens, in the year 1678. Huygens expressed an intuitive 
conviction that if each point on the wavefront of a light disturbance were 
considered to be a new source of a “secondary” spherical disturbance, 
then the wavefront at any later instant could be found by constructing 
the “envelope” of the secondary wavelets, as illustrated in Fig. 3-2. 


SS 


Secondary wavelets 


Envelope 
(new wavefront) 


Primary wavefront 


ee 


_ 


Figure 8-2 Huygens’ envelope construction. 
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Huygens’ intuitive ideas were greatly improved in 1818 in the famous 
memoir of Augustin Jean Fresnel, who supplemented Huygens’ envelope 
construction with Young’s principle of interference. By making some 
rather arbitrary assumptions about the effective amplitudes and phases 
of Huygens’ secondary sources, and by allowing the various wavelets to 
mutually interfere, Fresnel was able to calculate the distribution of light 
in diffraction patterns with excellent accuracy. 

The ideas of Huygens and Fresnel were put on a firmer mathematical 
foundation in 1882 by Gustav Kirchhoff, who succeeded in showing that 
the amplitudes and phases ascribed to the secondary sources by Fresnel 
were indeed logical consequences of the wave nature of light. Kirchhoff 
based his mathematical formulation upon two assumptions about the 
boundary values of the light incident on the surface of an obstacle placed 
in the way of the propagation of light. These assumptions were later 
proved inconsistent with each other, by Poincaré in 1892 and by Sommer- 
feld in 1894.! As a consequence of these criticisms, Kirchhoff’s formulation 
of the so-called Huygens-Fresnel principle must be regarded as a first 
approximation, although under most conditions it yields results that agree 
amazingly well with experiment. Kottler [Ref. 3-2] attempted to resolve 
the contradictions by reinterpreting Kirchhoff’s boundary value problem 
as a saltus problem, where saltus is a Latin word signifying a discon- 
tinuity or a jump of a function. The Kirchhoff theory was also modified 
by Sommerfeld, who eliminated one of the aforementioned assumptions 
concerning the light at the boundary by making use of the theory of 
Green’s functions. This so-called Rayletgh-Sommerfeld diffraction theory 
will be treated in Sec. 3-4. 

It should be emphasized from the start that the Kirchhoff and 
Rayleigh-Sommerfeld theories share certain major simplifications and 
approximations. Most important, light is treated as a scalar phenomenon; 
i.e., only the scalar amplitude of one transverse component of either the 
electric or the magnetic field is considered, it being assumed that any 
other components of interest can be treated independently in a similar 
fashion. Such an approach entirely neglects the fact that the various 
components of the electric and magnetic field vectors are coupled through 
Maxwell’s equations and cannot be treated independently. Fortunately, 
experiments in the microwave region of the spectrum [Ref. 3-3] have shown 
that the scalar theory yields very accurate results if. two conditions are 
met: (1) the diffracting aperture must be large compared with a wave- 
length, and (2) the diffracted fields must not be observed too close to the 
aperture. These conditions will be well satisfied in the problems treated 


1 For a more detailed discussion of these inconsistencies see Sec. 3-4. 
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here. For a more complete discussion of the applicability of the scalar 
theory in instrumental optics the reader may consult Ref. 3-4 (Sec. 8.4). 
Nonetheless, there do exist important problems for which the required 
conditions are not satisfied, for example in the theory of high-resolution 
diffraction gratings [Ref. 3-5]. Such problems are excluded from our 
considerations here, since the vectorial nature of the fields must be taken 
into account if reasonably accurate results are to be obtained. Vectorial 
generalizations of diffraction theory do exist, the first satisfactory treat- 
ment being due to Koittler [Ref. 3-6]. 

The first truly rigorous solution of a diffraction problem was given 
in 1896 by Sommerfeld [Ref. 3-7], who treated the two-dimensional case 
of a plane wave incident on an infinitesimally thin, perfectly conducting 
half plane. Kottler [Ref. 3-8] later compared Sommerfeld’s solution with 
the corresponding results of Kirchhoff’s scalar treatment. 

Needless to say, a historic introduction to a subject so widely men- 
tioned in the literature can hardly be considered complete. The reader 
is therefore referred to such comprehensive treatments of diffraction 
theory as, for example, Refs. 3-9, 3-10, and 3-11. 


3-2. MATHEMATICAL PRELIMINARIES 


Before embarking on a treatment of diffraction itself, we first consider a 
number of mathematical preliminaries that form the basis of the later 
diffraction-theory derivations. These initial discussions will also serve to 
introduce the notation to be used throughout. 


The Helmholtz equation 


Let the light disturbance at position P and time ¢ be represented by the 
scalar function u(P,t); for the case of linearly polarized waves, we may 
regard this function as representing either the electric or the magnetic 
field strength. Attention is restricted initially to the case of purely 
monochromatic waves, with the generalization to polychromatic waves 
being deferred to Sec. 3-5. 

For a monochromatic wave, the field may be written explicitly as 


u(P,t) = U(P) cos [2avt + $(P)] (a1) 


where U(P) and ¢(P) are the amplitude and phase, respectively, of the 
wave at position P, while » is the optical frequency. A more compact 
form of (3-1) is found by using complex notation, writing 


u(P,t) = Re [U(P) exp (—j277t)] (3-2) 
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where U(P) is the following complex function of position (sometimes 


called a phasor) 
U(P) = U(P) exp [—34(P)] (3-3) 


and Re is shorthand notation meaning the real part of. 
If the real disturbance u(P,t) is to represent an optical wave, it 
must satisfy the scalar wave equation 


V4 = a = (3-4) 


But the complex function U(P) serves as an adequate description of the 
disturbance, since the time dependence is known a priori. If (3-2) is 
substituted in (3-4), it follows that the complex disturbance U must obey 
the time-independent equation 


(Vv? + k?7)U =0 (3-5) 
where k is termed the wave number and is given by 


i 2— as 
rN 

The relation (3-5) is known as the Helmholtz equation; we may assume in 
the future that the complex amplitude of any monochromatic optical 


disturbance propagating through free space must obey such a relation. 


Green's theorem 


Calculation of the complex disturbance U at an observation point in 
space can be accomplished with the help of the mathematical relation 
known as Green’s theorem. This theorem, which will be found in most 
texts on advanced calculus [e.g., Ref. 3-12, p. 316], can be stated as 
follows: 


Green’s theorem. Let U(P) and G(P) be any two complex-valued functions 
of position, and let S be a closed surface surrounding a volume V. 
If U, G, and their first and second partial derivatives are single- 
valued and continuous within and on S, then we have 


J (GV°U — UV°G) )dn= [f (oo us) as (3-6) 
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Figure 3-3 Surface of integration. 


where 0/dn signifies a partial derivative in the outward normal 
direction at each point on S. 


This theorem is in many respects the prime foundation of scalar 
diffraction theory. However, only a prudent choice of a so-called Green’s 
function G and a closed surface S will allow its direct application to the 
diffraction problem. We turn now to the former of these problems, 
considering Kirchhoff’s choice of a Green’s function and the consequent 
integral theorem that follows. 


The integral theorem of Helmholtz and Kirchhoff 


The Kirchhoff formulation of the diffraction problem is based on a certain 
integral theorem which expresses the solution of the homogeneous wave 
equation at an arbitrary point in terms of the values of the solution and 
its first derivative on an arbitrary closed surface surrounding that point. 
This theorem had been derived previously in acoustics by H. von 
Helmholtz. 

Let the point of observation be denoted Po, and let S denote an 
arbitrary closed surface surrounding Po, as indicated in Fig. 3-3. The 
problem is, of course, to express the optical disturbance at Po in terms 
of its values on the surface S. To solve this problem, we follow Kirchhoff 
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in applying Green’s theorem and in choosing as a Green’s function G a 
unit-amplitude spherical wave expanding about the point Po (that is, 
the so-called free-space Green’s function). Thus, the value of G at an 
arbitrary point P; is given by 
exp (jkro1) 
Cae Sek mcmama ee (3-7) 


Yo. 


where we adopt the notation that ro is the length of the vector fo: pointing 
from Po to P. 

To be legitimately used in Green’s theorem, the function G (as well 
as its first and second derivatives) must be continuous within the enclosed 
volume V. Therefore, to exclude the discontinuity at Po, a small spherical 
surface S,, of radius e, is inserted about the point Po. Green’s theorem is 
then applied, the volume of integration V’ being that volume lying 
between S and S,, and the surface of integration being the composite 
surface 

=S+8, 
as indicated in Fig. 3-3. Note that the ‘‘outward”’ normal to the composite 
surface points outward in the conventional sense on S, but inward (toward 
Po) on S,. 

Within the volume V’, the disturbance G, being simply an expanding 
spherical wave, satisfies a Helmholtz equation 


(V2 + k)G =0 (3-8) 


Substituting the two Helmholtz equations (3-5) and (3-8) in the left-hand 
side of Green’s theorem, we find 


JJ (GV2U — UV?G) dv = — If] (GUk? — UGK?) dv = 0 


Thus the theorem reduces to 


Mes an us) as =0 
«  -[f(o8-v®)a-f(et-u8)a os 


Se 
Now note that for a general point P; on S’ we have 
ex Ik , 
G(P)) = ea © rot) 
ro. 


aG(P ki 
and CE EE et (3: uf ~) 2 (ikrex) (3-11) 


dn Yo. To. 


(3-10) 
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where cos (fi,fo1) represents the cosine of the angle between the outward 
normal f and the vector fo: joining Py to P;. For the particular case of P; 


on S,, cos (fi,fo1) = —1, and these equations become 
eis dG(P uaa 
G(P;) = — and Sane ( = jt) (3-12) 
€ on e \e 


Letting e become arbitrarily small, the continuity of U (and its derivatives) 
at P» allows us to write 


y 2S , | OU(Po) e* RET ; 
Hie an ue) as moe | on ad ies ok € (= -,«)| 


= —4nU(Py) 


0 


Substitution of this result in (3-9) yields 


Urry es aoe E wae) _y2 |= wee | ds (3-13) 


on To1 


This result is known as the integral theorem of Helmholtz and Kirchhoff; 
it plays a significant role in the development of the scalar theory of 
diffraction, for it allows the field at any point Po to be expressed in terms 
of the ‘boundary values” of the wave on any closed surface surrounding 
that point. As we shall now see, such a relation is instrumental in the 
development of scalar diffraction equations. 


3-3. THE KIRCHHOFF FORMULATION OF DIFFRACTION BY A PLANE SCREEN 


Consider now the problem of diffraction by an aperture in an infinite 
opaque screen. As illustrated in Fig. 3-4, a wave disturbance is assumed 
to impinge on the screen and aperture from the left, and the field at the 
point P» behind the aperture is to be calculated. 


Application of the integral theorem 


To find the field at the point Po, we apply the integral theorem of 
Helmholtz and Kirchhoff, being careful to choose a surface of integration 
that will allow the calculation to be performed successfully. Following 
Kirchhoff, the closed surface S is chosen to consist of two parts, as 
shown in Fig. 3-4. Let a plane surface, S., lying directly behind the 
diffracting screen, be joined and closed by a large spherical cap, So, of 
radius R and centered at the observation point Po. The total closed surface 
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Figure 3-4 Kirchhoff formulation of 
diffraction by a _ plane 
screen. 


S is simply the sum of 8; and S:. Thus, applying (3-13), 


1 aU aG 
U(Po) = 7 il (Rc -ue)a 


SitS2 
where, as before, 


exp (jkro1) 


To. 


G= 


As R increases, S2 approaches a large hemispherical shell. It is 
tempting to reason that, since both U and G will fall off as 1/R, the inte- 
grand will ultimately vanish yielding a contribution of zero from the 
surface integral over So. However, the area of integration increases as 
R*, so this argument is incomplete. It is also tempting to assume that, 
since the disturbances are propagating with finite velocity c, R will 
ultimately be so large that the waves have not yet reached S2, and the 
integrand will be zero on that surface. But this argument is incompatible 
with our assumption of monochromatic disturbances, which must (by 
definition) have existed for all time. Evidently a more careful investiga- 
tion is required before the contribution from S» can be disposed of. 

Examining this problem in more detail, we see that on S»2 


exp (jkR) 
Rk 


dG 1\exp GAR) _ . 
ees [eave Vs ae 
(; =) R ~jkG 


G= 
and, from (3-11), 
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where the last approximation is valid for large R. The integral in question 
can thus be reduced to 


[J |o= - UG) | ds = fc (= ~ jhU) Re ds 


where @ is the solid angle subtended by 8» at Po. Now the quantity |RG| 
is uniformly bounded on S:. Therefore, the entire integral over S» will 
vanish as & becomes infinite provided the disturbance U has the property 


a R(o jt) = 0 (3-14) 
uniformly in angle. This requirement is known as the Sommerfeld radia- 
tion condition [Ref. 3-13] and is satisfied if the disturbance U vanishes at 
least as fast as a diverging spherical wave (see Prob. 3-1). Since the dis- 
turbances that illuminate the aperture will invariably consist of a spher- 
ical wave or a linear combination of spherical waves, we can be confident 
that this requirement will be satisfied in practice, and therefore that the 
integral over S2 will yield a contribution of precisely zero. 


The Kirchhoff boundary conditions 


Having disposed of the integration over the surface S., it is now possible 
to express the disturbance at Py in terms of the disturbance and its 
normal derivative over the infinite plane S; immediately behind the 


screen, that is, 
1 au aG 
aes See S = | fe =f 
U(P.) is [f (Ss U my) as (3-15) 


The screen is opaque, except for the open aperture which will be 
denoted ». It therefore seems intuitively reasonable that the major con- 
tribution to the integral (3-15) arises from points of S; located within the 
aperture =, where we would expect the integrand to be largest. Kirchhoff 
accordingly adopted the following assumptions [Ref. 3-14]: 


1. Across the surface 2, the field distribution U and its derivative 
aU/dn are exactly the same as they would be in the absence of 
the screen. 

Over the portion of Si that lies in the geometrical shadow of 
the screen, the field distribution U and its derivative dU/dn are 
identically zero. 


wo 


These conditions are commonly known as the Kirchhoff boundary 
conditions. The first allows us to specify the disturbance incident on the 
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aperture by neglecting the presence of the screen. The second allows us 
to neglect all of the surface of integration except that portion lying 
directly within the aperture itself. Thus (3-15) is reduced to 


1 aU aG 
U(P.) = = [J (S Ceau ) ds (3-16) 


While the Kirchhoff boundary conditions simplify the result con- 
siderably, it is important to realize that neither can be exactly true. 
The presence of the screen will invariably perturb the fields on 2 to some 
extent, for along the rim of the aperture certain boundary conditions 
must be met that would not be required in the absence of the screen. In 
addition, the shadow behind the screen is never perfect, for fields will 
invariably extend behind the screen for a distance of several wavelengths. 
However, if the dimensions of the aperture are large compared with a 
wavelength, these fringing effects can be safely neglected,! and the two 
boundary conditions can be used to yield results that agree very well with 
experiment. 


The Fresnel-Kirchhoff diffraction formula and the Huygens-Fresnel principle 


A further simplification of the expression for U(P») is obtained by noting 
that the distance ro: from the aperture to the observation point is usually 
many optical wavelengths, and therefore, since k > 1/ro1, Eq. (3-11) 
becomes 


aG(P 1 ee 
( 1) Beas (f,01) (se - ) exp (jkro1) 
on T01 To1 
= jk cos (fi,F01) ped SRSLY (gkros) 
To. 


Substituting this approximation and the expression (3-7) for G in Eq. 
(3-16), we find 


1 exp (jkrox) [ aU j oi 
U(Po) = An I ro > — JK cos to) | ds (3-17) 


1 Or 


Now suppose that the aperture is illuminated by a single spherical 
wave, 
A x Ik i) 
UIP = exp (jkra1) 


To. 


' As we shall see, objections to the use of the Kirchhoff boundary conditions arise, 
not because of fringing effects, but rather because of certain internal inconsistencies. 
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Figure 3-5 Point-source illumination of a 
plane screen. 


arising from a point source at Ps, a distance ro, from Py (see Fig. 3-5). 
If ro: is many optical wavelengths long, then (3-17) can be directly 
reduced (see Prob. 3-2) to 


= a exp [7k(rer ae ro1)] cos (fi,fo1) — COS (fi,fo1) 
uiPe) = = ff 


5 | ds (3-18) 


T21%01 


This result, which applies only for an illumination consisting of a single 
point source, is commonly known as the Fresnel-Kirchhoff diffraction 
formula. 

Note that Eq. (3-18) is symmetrical with respect to the point source 
and the observation point. Thus a point source at Po will produce at P: 
the same effect that a point source of equal intensity placed at P» will 
produce at Po. This result is sometimes referred to as the reciprocity 
theorem of Helmholtz. 

Finally, we consider an interesting and useful interpretation of the 
diffraction formula (3-18). Let that equation be rewritten as follows: 


UPS i / (Py) SPD (3-19) 


1 


where © U’(P,) == 5 -2 


: E exp Pe) E (fi,fo1) — cos Ae!) (3 20) 
4) 


Yo. 


Now (3-19) may be interpreted as implying that the field at Po arises from 
an infinity of fictitious “secondary” point sources located within the 
aperture itself. The amplitude U’(P;) of the secondary source located at 
P, is evidently proportional to the amplitude [A exp (jkro1)]/rs1 of the 
wave incident at P;, but it differs from that amplitude in three respects. 
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First, the amplitude differs from the incident amplitude by a factor \—’. 
Second, that amplitude is also reduced by a so-called obliquity factor 
lé[cos (fi,fo1) — cos (fi,f21)] which is never greater than unity nor less 
than zero; in effect, there is a nonisotropic “directivity pattern” associated 
with each secondary source. Finally, the phase of the secondary source 
at P, leads the phase of the incident wave by 90°. 

These curious properties of the secondary sources are significant in a 
historical sense, for in the earlier work of Fresnel, the combination of 
Huygens’ envelope construction and Young’s principle of interference 
was found to yield accurate predictions of diffraction patterns only if 
properties such as these could be ascribed to the secondary sources. In 
order to obtain accurate results, Fresnel assumed that these properties 
were true. Kirchhoff’s mathematical development demonstrated that 
they were in fact natural consequences of the wave nature of light. 

Note that the above derivation of the so-called Huygens-Fresnel 
principle has been restricted to the case of an aperture illumination con- 
sisting of a single expanding spherical wave. However, we shall return 
to the Huygens-Fresnel principle shortly to demonstrate that it is indeed 
more general than our first examination has implied. 


3-4. THE RAYLEIGH-SOMMERFELD FORMULATION OF DIFFRACTION 
BY A PLANE SCREEN 


The Kirchhoff theory has been found experimentally to yield remarkably 
accurate results and is widely used in practice. However, there are certain 
internal inconsistencies in the theory which have motivated a search for 
more satisfactory mathematical developments. The difficulties of the 
Kirchhoff theory stem from the fact that boundary conditions must be 
imposed on both the field strength and its normal derivative. In particular, 
it is a well-known theorem of potential theory that if a two-dimensional 
potential function and its normal derivative vanish together along any 
finite curve segment, then that potential function must vanish over the 
entire plane. Similarly, if a solution of the three-dimensional wave equa- 
tion vanishes on any finite surface element, it must vanish in all space. 
Thus the two Kirchhoff boundary conditions together imply that the 
field is identically zero everywhere behind the aperture, a result which 
contradicts the known physical situation. A further indication of these 
inconsistencies is the fact that the Fresnel-Kirchhoff diffraction formula 
can be shown to fail to reproduce the assumed boundary conditions as 
the observation point approaches the screen or aperture. In view of these 
contradictions, it is indeed remarkable that the Kirchhoff theory yields 
such accurate results. 
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The inconsistencies of the Kirchhoff theory were removed by Som- 
merfeld, who eliminated the necessity of imposing boundary values on 
both the disturbance and its normal derivative simultaneously. This 
so-called Rayleigh-Sommerfeld theory is the subject of the present section. 
For a comparison of the Kirchhoff and Rayleigh-Sommerfeld theories, 
see Ref. 3-15.1 


Choice of alternative Green’s functions 


Consider again Eq. (3-15) for the observed field strength in terms of the 
incident field and its normal derivative across the entire screen: 


Wise al i (Sc - u=) ds (3-15) 


Suppose that the Green’s function G of the Kirchhoff theory were modified 
in such a way that, while the development leading to the above equation 
remains valid, in addition either G or dG/dn vanishes over the entire 
surface S,. In either case, the necessity of imposing simultaneous boundary 
conditions on U and dU/dn would be removed, and the inconsistencies of 
the Kirchhoff theory would be eliminated. 

Sommerfeld pointed out that Green’s functions with the required 
properties do indeed exist. Suppose that G is generated not only by a 
point source located at Po, but also by a second point source located at 
a position Py which is the mirror image of Py on the opposite side of the 
screen (see Fig. 3-6). Let the source at Py be of the same wavelength 
as the source at Po, and suppose that the two sources are oscillating with 
a 180° phase difference. The Green’s function is in this case given by 


exp (jkrox) $e. exp (jk7o1) 


To1 To1 


Ge(P}) = 


where 7 is the distance from Py) to Pi. The corresponding normal 
derivative of G_ is 


0G_ 
on 


il ) exp (jkro1) 


Yo1 


= cos (fi,fo1) (se = 


Yo. 


(3-22) 


= 1 \ exp (jkFo1) 
— cos (fi,fo1) ( jk — = 


To1 To1 

1 The fact that one theory is self-consistent while the other is not does not neces- 
sarily mean that the former is more accurate than the latter. The relative accuracy 
of the Kirchhoff and Rayleigh-Sommerfeld formulations is still a subject of active 
research, 
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Jey Po 


To Toi 


Figure 8-6 Rayleigh-Sommerfeld formulation 
of diffraction by a plane screen. 


Now for P; on S;, we have 


Yor = To1 " 
cos (fi,fo1) = — cos (fi,fo1) 
and therefore on that surface 
G_(P)) = 0 
ZP 1 ik 
aCe y (Py = 2 cos (i,fo1) (3: — \ee (Jkrox) (3-23) 
on To1 To1 


Thus the function G_ vanishes over the entire surface Sy. 
An alternative (and equally useful) Green’s function is found by 
allowing the two point sources to oscillate in phase, giving 


G,(P3) a exp (7kro1) i exp (ikF un) 


Yo. To1 


(3-24) 


It is readily shown (see Prob. 3-3) that the normal derivative of this 
function vanishes across the screen and aperture. 


The Rayleigh-Sommerfeld diffraction formula 


Let the Green’s function G_ be substituted for G in Eq. (3-15). Using 
(3-23), it follows directly that 


U(P.) = J U(P:) mp cos (,fo1) ds (3-25) 
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where it has been assumed that 79; >> \. The Kirchhoff boundary condi- 
tions may now be applied to U alone, yielding the very general result 


1 1k 01 
U(Ps) = 5 i fuer ap cos (fi,F91) ds (3-26) 


Since no boundary conditions need be applied to dU/dn, the inconsist- 
encies of the Kirchhoff theory have evidently been removed. 

For the purpose of comparing these results with the predictions of the 
Kirchhoff theory, let the aperture be illuminated by a spherical wave 
diverging from a point source at position P», (see Fig. 3-5 again). Thus 


exp (Jkro1) 


Tor 
_ A exp [gk (121 + 101)] Ae 
and UP) = ff ror 08 ta) ds (8-27) 


This latter expression is known as the Rayleigh-Sommerfeld diffraction 
formula, and it should be compared with the corresponding result of the 
Kirchhoff theory, Eq. (3-18). Note that the only difference between the 
two expressions occurs in the obliquity factor. 

Consideration of the corresponding results when the Green’s function 
G., is chosen is deferred to the problems. The expression for U(Po) 
derived using this alternative Green’s function is, of course, just as valid 
as that derived above. 


Further discussion of the Huygens-Fresnel principle 


Our previous derivation of the Huygens-Fresnel principle assumed that 
the diffracting aperture is illuminated by a single expanding spherical 
wave. From the expression (3-26) it is clear that the principle actually 
holds for more general aperture illuminations. This generality is really 
implied by the more specific case, for an arbitrary illumination can always 
be decomposed into a (possibly infinite) collection of point sources, and 
the linearity of the wave equation can then be invoked to apply the 
principle to each point source individually. 

For our purposes here it is important to realize that the Huygens- 
Fresnel principle, as expressed by Eq. (3-26), is in reality no more than a 
superposition integral of the type discussed in Chapter 2. To emphasize 
this point of view, we may rewrite (3-26) as 


U(P.) = ff h(Po,P)UPy ds (3-28) 
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where the weighting function h(Po,P1) is given explicitly by 


eb exp (jkro 1) 


Pie 
h(Po,P1) on 


cos (fi,fo1) (3-29) 
The occurrence of a superposition integral as a result of our diffraction 
analysis should not be a complete surprise. The primary ingredient 
required for such a result was previously seen to be linearity, a property 
that was assumed early in our analysis here. 


3-5 GENERALIZATION OF THE RAYLEIGH-SOMMERFELD THEORY TO 
NONMONOCHROMATIC WAVES 


The wave disturbances have previously been assumed ideally mono- 
chromatic in all cases. Such waves can be closely approximated in practice, 
and are particularly easy to analyze. However, the more general case of 
a nonmonochromatic disturbance will now be considered briefly; attention 
is restricted to the predictions of the Rayleigh-Sommerfeld formulation, 
but similar results can be obtained from the Kirchhoff analysis. 

Consider the scalar disturbance u(Po,t) observed behind an aperture 
> in an opaque screen when a disturbance u(P:,t) is incident on that 
aperture. The time functions u(Po,t) and wu(P1,t) may be expressed as 
inverse Fourier transforms: 


+ 
MEAs ile. U(P1,v) exp (j2nvt) dy (3-30a) 


AP pty = ioe Uso) exp Genes (3-300) 


where U(Po,v) and U(P,,v) are simply the Fourier spectra of u(Po,t) and 
u(P,,t), respectively, and v represents optical frequency. 

Let Eqs. (8-30a,b) be transformed by the change of variables 
py’ = —y, yielding 


+ 

iu Piiae ics U(Pi, — »’) exp (—j2mv’t) do’ (3-31a) 
+ 20 

ul Pet = ie U (Bakes nie ne (3-31b) 


Now these relations may be regarded as expressing the nonmonochromatic 
time functions u(Pi,t) and u(Po,t) as a linear combination of mono- 
chromatic time functions of the type represented by Eq. (3-2). The mono- 
chromatic elementary functions are of various frequencies »’, the complex 
amplitudes of the disturbance at frequency »’ being simply U(Pi,—»’) 
and U(Po,—»’). By invoking the linearity of the wave-propagation 
phenomenon, we shall use the results of Sec. 3-4 to find the complex 
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amplitude of each monochromatic disturbance at Po, and superimpose 
the results to yield the general time function u(P,,t). 
To proceed, Eq. (3-26) can be directly used to write 


exp (200 a) 
: ' c ae 
UP Ge) =] — i CE) 0s (iy) as (e802) 
cs ro. 
where the relation \’»’ = c has been noted. Substitution of (3-32) in 
(3-31b) and an interchange of orders of integration give 


u(P,t) 
Cos (fi,Fo1) i 
- =e cos (fi,fo1) 01 ies | 72m U(P;, — 9’) exp | 2m" (: — = dv' ds 
C 


2acr 01 
Finally, the identity 


d d ft 
i Cr ails Fel ee U(Pi,—»’) exp (—j2zmv’t) dv’ 


+2 : 
= ie —j2rv'U(P1,—v’) exp (—j2mv't) dv’ 


can be used to write 


u(Pot) = J | ee ebay “ “u(P, -") ds (3-33) 


2Qrcr 01 


Thus the wave disturbance at point Po is seen to be linearly propor- 
tional to the time derivative of the disturbance at each point P; on the 
aperture. Since it takes time 7o1/c for the disturbance to propagate from 
P; to Po, the observed wave depends on the derivative of the incident 
wave at the “retarded” times t — (101/c). 

This more general treatment shows that the study of diffraction of 
perfectly monochromatic waves is in no way an academic subject, for 
the results for more general time disturbances are readily synthesized 
from the monochromatic results. In addition, optical sources that may be 
regarded for most purposes as ideally monochromatic are readily available 
(e.g., the helium-neon gas laser). For further consideration of nonmono- 
chromatic waves see Prob. 3-5. 


3-6 DIFFRACTION AT BOUNDARIES 


In the statement of the Huygens-Fresnel principle, we found it convenient 
to regard each point on the aperture as a new source of spherical waves. 
It was pointed out that such sources are merely mathematical conven- 
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iences and have no real physical significance. A more physical point of 
view, first expressed qualitatively by Thomas Young in 1802, is to regard 
the observed field as consisting of a superposition of the incident wave 
transmitted through the aperture unperturbed, and a diffracted wave 
originating at the rim of the aperture. The possibility of a new wave 
originating in the material medium of the rim makes this interpretation 
a more physical one. 

Young’s qualitative arguments were given added impetus by Som- 
merfeld’s rigorous solution to the problem of diffraction by a semi- 
infinite, perfectly conducting reflecting screen [Ref. 3-7]. This rigorous 
solution showed that the field in the geometrical shadow of the screen 
has the form of a cylindrical wave originating on the rim of the screen; 
while in the illuminated region behind the screen, the field is the super- 
position of this cylindrical wave with the directly transmitted incident 
wave. 

The applicability of a boundary-diffraction approach in more 
general diffraction problems was investigated by Maggi [Ref. 3-16] and 
Rubinowicz [Ref. 3-17], who showed that the Kirchhoff diffraction formula 
can indeed be manipulated to yield a form that is equivalent to Young’s 
ideas. More recently, Miyamoto and Wolf [Ref. 3-18] have extended the 
theory of boundary diffraction. For further discussion of these ideas the 
reader should consult the references cited. 

Another approach that is closely related to Young’s ideas is the 
geometrical theory of diffraction developed by Keller [Ref. 3-19]. In 
this treatment, the field behind a diffracting obstacle is found by the 
principles of geometrical optics, modified by the inclusion of ‘‘diffracted 
rays’ that originate at certain points on the obstacle itself. New rays are 
assumed to be generated at edges, corners, tips, and surfaces of the 
obstacle. This theory can often be applied to calculate the fields diffracted 
by obstacles that are too complex to be treated by other methods, and is 
therefore enjoying an increasing level of popularity. 


3-7 THE ANGULAR SPECTRUM OF PLANE WAVES 


It is also possible to formulate scalar diffraction theory in a framework 
that closely resembles the theory of linear time-invariant filters. While in 
physical optics this treatment is less widely used than the Kirchhoff 
theory, nonetheless its intuitive appeal to readers grounded in linear Sys- 
tems theory makes its inclusion here well worthwhile. As we shall see, 
if the complex field distribution across any plane is Fourier analyzed, 
the various spatial Fourier components can be identified as plane waves 
traveling in different directions. The field amplitude at any other point 
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can be calculated by adding the contributions of these plane waves, tak- 
ing due account of the phase shifts they have undergone in propagating 
to the point in question. For a detailed treatment of this approach to 
diffraction theory, as well as its application in the theory of radio-wave 
propagation, the reader is referred to the work of Ratcliffe [Ref. 3-20]. 


The angular spectrum and its physical interpretation 


Suppose that, due to some unspecified system of monochromatic sources, 
a wave is incident on the zy plane of Fig. 3-7, traveling in the positive 
z direction. Let the complex field across that plane be represented by 
U(z,y,0); our ultimate objective is to calculate the consequent field 
U(x,y,z) that appears at a second point Po with coordinates (x,y,z). 

Across the zy plane, the function U has a two-dimensional Fourier 
transform given by 


A(fxsfv) = ff U(e,y,0) exp [—j2e(fxx + fry] dx dy (3-84) 


As pointed out in Chapter 2, the operation of a Fourier transformation 
may be regarded as a decomposition of a complicated function into a 
collection of more simple complex-exponential functions. To emphasize 
this point of view, we write U as an inverse transform of its spectrum 


U(x,y,0) = lf Ao(fx,fy) exp [j2a(fxx SF fry)] dfx dfy (3-35) 


Now recall that the equation for a unit-amplitude plane wave propagating 
with direction cosines (a,8,y) is simply 
20 
B(x,y,z) = exp | J (aw + By + ve) 
where y=V1- a’ —- #? 


Thus across the plane z= 0, a complex-exponential function exp 
[j2r(fxx + fry)] may be regarded as a plane wave propagating with 
direction cosines 


a=AMx B=My y= V1— (fx)? — (My)? (3-36) 


The complex amplitude of that plane-wave component is simply 
A.(fx,fy) dfx dfy, evaluated at (fx = a/d, fy = B/d). For this reason, 
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the function 


a B ji 3 a B 
—-)= — = — dx d 3-37 
A. (Z :) i] U(z,y,0) exp | jon (<: ss ey) tdy (3-37) 
is called the angular spectrum of the disturbance U(z,y,0). 


Propagation of the angular spectrum 


Consider now the angular spectrum of the disturbance U across a plane 
parallel to the zy plane but at a distance z from it. Let the function 
A(a/X, B/d;2) represent the angular spectrum of U(z,y,z), that is, 


A (2.5 e 2) = I U(z,y,z) exp | -i2s (: x + : »)| dx dy (3-38) 


Now if the relation between A,(a/\, B/A) and A(a/d, B/d3z) can be 
found, then the effects of wave propagation on the angular spectrum of 
the disturbance will be evident. 

To find the desired relation, note that U can be written 


UGueE [fa (<2; 2) exp Jie (22 +0) |aeac (3-39) 


In addition, U must satisfy the Helmholtz equation 
VU + k?U = 0 


at all source-free points. Direct application of this requirement to Eq. 
(3-39) shows that A must satisfy the differential equation 


d? a Bo Qa a Bp = 
Gee lace) bee ot Ge 2) =0 


An elementary solution of this equation can be written in the form 


A (< ef :) = /N; (: *) exp (; i V1—- a — p :) (3-40) 


Re mn 
This result demonstrates that when the direction cosines (a,8) satisfy 
ar PB E 


the effect of propagation over a distance z is simply a change in the relative 
phases of the various components of the angular spectrum. Since each 
plane-wave component propagates at a different angle, each travels a 
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different distance to reach a given observation point and relative phase 
delays are thus introduced. 
However, when (a,8) satisfy 


Ce as tee 


a different interpretation is required. In such a case, the square root in 
Eq. (3-40) is imaginary, and that equation can be written 


A (< =, :) = A; (<. *) exp (— uz) (3-41) 


2a 
where mes ee Agate 


Since yp is a positive real number, these wave components are strongly 
attenuated by the propagation phenomenon. Such components of the 
angular spectrum are called evanescent waves and are quite analogous 
to the waves produced in a microwave waveguide that is driven below 
its cutoff frequency. Note that the limiting case 


a? + p? = 1 


corresponds to plane waves that are traveling in directions normal to 
the z axis, contributing no net power flow in the z direction.! 

Finally, we note that the disturbance observed at (z,y,z) can be 
written in terms of the initial angular spectrum by inverse-transforming 
Eq. (3-40), giving 


vena) = ff (22) ex (5% Vi= = #2) 


Pea eels ey 
exp |i2e(Z2 +£x)] ae ae (3-42) 


Effects of a diffracting aperture on the angular spectrum of a disturbance 


Suppose that an infinite opaque screen containing an aperture 2 is 
introduced in the plane z = 0 of Fig. 3-7. We now consider the effects 
of that diffracting screen on the angular spectrum of the disturbance. 
Let the angular spectrum of the disturbance incident on the screen be 


1 Note that evanescent waves are predicted only under the very same conditions 
for which the use of a scalar theory is suspect. They therefore are more properly 
treated in a vectorial approach. 
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Directions of 
wave <— 
propagation 


y 


Figure 3-7 Coordinate system for angular spectrum 
calculations. 


A.(a/d, B/d), and define the transmittance function of the aperture by 


eyes (x,y) in 2 
it,y) = 0 otherwise 


We apply the Kirchhoff boundary conditions to the disturbance U, assum- 
ing that the screen does not disturb the waves incident on 2 and that the 
fields in the geometrical shadow of the screen are identically zero. Thus 
the fields across a plane immediately behind the screen may be written 


U,(x,y,0) a U:(x,y,0)t(x,y) 


where U,(x,y,0) represents the fields incident on the screen. The convolu- 
tion theorem of Fourier analysis can now be used to write the angular 
spectrum A,(a/\, 8/d) of the transmitted waves as 


a B\_, (2 8\ (28 ‘ 
a (% =) = a.(% | (2 :) (3-43) 


B " B 
where T (¢ =) ZS JJ t(x,y) exp | = (c r+ - )| dx dy (3-44) 


Thus the angular spectrum of the transmitted disturbance is found by 
convolving that of the incident disturbance with a second angular 
spectrum which is characteristic of the aperture > itself. 

For the special case of a single, unit-amplitude plane wave illuminat- 
ing the aperture normally, the result takes a particularly simple form. 
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In that case 


and A; (<. e) = (= Sat (% *) = r(% ‘) 
A Xr ON AXA AX 


Thus the transmitted angular spectrum is found directly by Fourier 
transforming the transmission function of the aperture. 

To summarize, the introduction of a diffracting aperture which 
spatially limits an incident wave has the effect of broadening the angular 
spectrum of the disturbance. When the aperture is illuminated by a 
plane wave, the angular spectrum of the transmitted light can be found 
by directly Fourier transforming the transmission function of the aperture. 
The smaller the aperture, the broader the angular spectrum behind the 
aperture. An analogous effect occurs, of course, in the pulsing of electrical 
signals; a limitation of the temporal duration of a function has the effect 
of broadening its frequency spectrum. 


The propagation phenomenon as a linear spatial filter 


Consider again the propagation of light from the plane z = 0 of Fig. 3-7 
to a parallel plane at distance z. The disturbance U(z,y,0) incident on 
the first plane may be considered to be mapped by the propagation 
phenomenon into a new field distribution U(z,y,z). Such a mapping 
satisfies our previous definition of a system. We shall, in fact, demonstrate 
that the propagation phenomenon acts as a linear space-invariant 
system, and that it is characterized by a relatively simple transfer 
function. 

The linearity of the propagation phenomenon has already been 
discussed; it is directly implied by the linearity of the wave equation, or 
alternatively, by the superposition integral (3-28). The space-invariance 
property is most easily demonstrated by actually deriving a transfer 
function that describes the effects of propagation; if the mapping has a 
transfer function, then it must be space-invariant. 

To find the transfer function, we return to the angular spectrum 
point of view. However, rather than writing the angular spectra as func- 
tions of the direction cosines (a,8), it is now more convenient to leave 
the spectra as functions of the spatial frequencies (fx, fy). The spatial 
frequencies and the direction cosines are directly related through Eq. 
(3-36). 

Let the spatial spectrum of U(z,y,z) again be represented by 
A(fx,fy jz), while the spectrum of U(z,y,0) is again written A,(fx,fr)- 
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Thus we may express U(z,y,z) as 
U(e,y2) = ff AGxfriz) exp [j2e(fxa + fry)] dfx afy 


But in addition, from Eq. (3-42) we have 


Une) = [f AUisdy) ox | 22 VI Oi? = Of*| 


a oth ee ee ee 


A comparison of these two equations shows that 


A(fx,fy32z) = Ad(fx,fy) exp Ex V1 — (Afx)? — One| (3-45) 


and the propagation phenomenon is seen to be characterized by a transfer 
function H given by 


A(fx, fr; : 5 ; 
H(fx,fy) = a, = exp Ex V1 — (fx)? — fy) | (3-46) 


If the distance z is at least several wavelengths long, then the evanescent 
waves may be neglected, yielding a transfer function 


(Ravan fe: 1 
lee ay [Pes Med Sel a on | fx thet <5 


0 otherwise (3-47) 


In conclusion, then, the propagation phenomenon may be regarded 
as a linear dispersive spatial filter with a finite spatial bandwidth. The 
transmission of the filter is zero outside a circular region of radius \~! in 
the frequency plane. Within that circular bandwidth, the modulus of 
the transfer function is unity but frequency-dependent phase shifts are 
introduced. The phase dispersion of the system is most significant at 
high spatial frequencies and vanishes as fy and fy approach zero. 


PROBLEMS 


3-1 Show that a diverging spherical wave satisfies the Sommerfeld radiation 
condition. 


3-2. Show that Eq. (3-17) reduces to Eq. (3-18) when the aperture illumination 
consists of a diverging spherical wave. 
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3-3 Consider the use of the Green’s function 
“ ike = A 
GPa exp (jkro1) syeee (gh Fo1) 
To. To. 
in the Rayleigh-Sommerfeld theory. 
(a) Show that the normal derivative of G, vanishes across the plane of the 
aperture. 
(b) Using this Green’s function, find an expression for U(P») in terms of an arbi- 
trary disturbance across the aperture. What boundary conditions must be applied 
to obtain this result? 
(c) Using the result of (b), find an expression for U(P,) when the aperture 
illumination consists of a spherical wave diverging about the point P». 
3-4 Assuming unit-amplitude normally incident plane-wave illumination, find the 
angular spectrum of 
(a) a circular aperture of diameter d 
(b) a circular opaque disk of diameter d 
3-5 Consider a nonmonochromatic disturbance u(P,ft) of center frequency > and 
bandwidth Avy. Let a related complex-valued disturbance u_(P,t) be defined as 
consisting of only the negative-frequency components of u(P,t). Thus 
0 
ee ie GPW) can tion) ay 
where U(P,v) is the Fourier spectrum of u(P,t). Assuming the geometry of Fig. 
3-4, show that if 
A 1 
ie, th SS 
v Av Cc 
1 exp (jkro1) aie 
then v(Po,.). == If u_(P,,t) pee cos (f1,f01) ds 
jx To. 
b> 
where X = c/p and k = 27/X. 
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4 FRESNEL AND FRAUNHOFER 
DIFFRACTION 


In the preceding chapter the results of scalar diffraction theory were 
presented in their most general forms. Attention is now turned to certain 
approximations to the general theory—approximations that will allow 
diffraction-pattern calculations to be reduced to more simple mathe- 
matical manipulations. These approximations, which are commonly made 
in many fields that deal with wave propagation, will be referred to as the 
Fresnel and Fraunhofer approximations. Our treatment will differ from 
more common treatments of Fresnel and Fraunhofer diffraction in that, 
in accordance with our view of the propagation phenomenon as a “‘sys- 
tem,” we shall attempt to find approximations that are valid for any of a 
wide class of “input” field distributions. 


4-1 APPROXIMATIONS TO THE HUYGENS-FRESNEL PRINCIPLE 
Consider again diffraction of monochromatic light by a finite aperture 


> in an infinite opaque screen. As indicated in Fig. 4-1, the screen is 


Yi Aperture Yo 


ry 


—— 2 
—— 


ee 
ray=Ve +(ao— %) sh Ne 


Observation 
region 


Figure 4-1 Diffraction geometry. 
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assumed to be planar, with a rectangular coordinate system (21,¥1) 
attached. In addition, the region of observation is assumed to be a plane, 
standing parallel with the plane of the screen at a normal distance z. 
A coordinate system (2o,yo) is attached to the plane of observation, with 
the coordinate axes parallel with those in the z1y; plane. 


Initial approximations 


Using the mathematical expression of the Huygens-Fresnel principle, 
in particular Eqs. (3-28) and (3-29), the field amplitude at point (xo,yo) 
is readily written as 


U(x0,yo) = ff h(x0,Yo3t1,y1)U(a1,y1) dri dy1 (4-1) 
z 
1 kk 
where h(xo0,Yojt1y1) = = exp (Jhrox) cos (fi,fo1) (4-2) 


jy Tou 


In the future, this superposition integral will be written with infinite 
limits, it being understood that, in accordance with the Kirchhoff 
boundary conditions, U(x1,y1) is identically zero outside the aperture 2. 
Thus 


U(x0,yo) = il h(xo0,Yo321,y1)U (a1, y1) dri dy1 (4-3) 


Our approximations will be based on the assumption that the dis- 
tance z between aperture and observation plane is very much greater than 
the maximum linear dimension of the aperture L. In addition, we shall 
assume that in the plane of observation only a finite region about the 
z axis is of interest, and that the distance z is much greater than the 
maximum linear dimension of this region. With these assumptions, the 
obliquity factor is readily approximated by 


cos (fi,fo1) = 1 


where the accuracy is to within 5 percent if the angle (f,f91:) does not 
exceed 18°. Under similar conditions the quantity 79; in the denominator 
of (4-2) will not differ significantly from z, allowing the weighting function 
to be approximated as 


1 
h(xo,Yojt1,Y1) = — exp (jkroi) (4-4) 
rz 


Note that the quantity ro in the exponent cannot be replaced simply by 
z, for the resulting errors will be multiplied by a very large number k, 
and consequently can generate phase errors much greater than 27 radians. 
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The Fresnel approximations 


Further simplification can be accomplished only by adopting certain 
approximations to the quantity ro: in the exponent of Eq. (4-4). From 
Fig. 4-1, the distance roi is given exactly by 


Lie = V2 + (% — 21)? + (Ye — Yi)” 


=e yi a (B=) a6 (Hany (4-5) 


A convenient means of approximation is offered by a binomial expansion 
of the square root, 


——— 1 1 
ee lb] <1 (4-6) 


We therefore asswme that the square root in Eq. (4-5) is adequately 
approximated by the first two terms of its expansion, giving 


Lo — t1\? — y;:\? Le/— 3) 29.015 Ue, AN: 
x h+ (2) +(4—*) ~2[1+3(2—*) +3(4—4)) 
z zZ 2 Zz 2 z 
This assumption, which will be referred to as the Fresnel approximation, 
allows the weighting function to be rewritten as 


exp (jkz) , 


h(xo,Yo3%1,Y1) = dz 


xp {is leo — 2)? + evo] 2 
When the distance z is sufficiently large for this approximation to be an 
accurate one, the observer is said to be in the region of Fresnel diffraction. 

Note that the critical part of our approximation has been the replace- 
ment of the spherical Huygens’ wavelets by quadratic surfaces. Clearly, 
the accuracy of such an approximation requires certain bounds on the 
relative sizes of the aperture, the observation region, and the distance z. 
Asa sufficient condition for accuracy, we might require that the maximum 
phase change contributed by the next-higher-order term in the binomial 
expansion be much less than 1 radian. This condition will be met if the 
distance z satisfies 


23 >> — [(to — t1)? + (Yo — Y1)"Iinax (4-8) 


Tv 
4r 
However, this requirement is in general not a necessary one. For the 
Fresnel approximation to remain valid, it is only required that the 
higher-order terms of the expansion not change the value of the super- 
position integral (4-3); the maximum values of the added phase factors 
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need not be much less than 1 radian for this to be true. For distances z 
small enough to violate (4-8), the quantity k/2z will generally be so 
large, and the oscillations of the quadratic phase factor in Eq. (4-7) so 
rapid, that the primary contribution to the integral will arise only from 
points near (%1 = to, Y1 = Yo), Where the rate of change of phase is 
minimum. In the neighborhood of such points of ‘stationary phase,”’ the 
magnitude of the next-higher-order phase term is often entirely negligible. 
For further discussion of this so-called principle of stationary phase see 
Appendix III of Ref. 3-4. 

Accepting the validity of the Fresnel approximation, the superposi- 
tion integral can be expressed in either of two equivalent forms. First, 
U(xo,yo) may be regarded as a convolution of U(r1,y1) with h; that is, 


U(xo,Yo) 
= ead if U(x1,y1) exp lis. Se are Gee | Presb, 
(4-9) 


Alternatively, the quadratic terms in the exponent may be expanded to 
yield 


F : 
(eas) = FEE exp [55 Geot + uo | [f {UEoud 


k Dy 
exp Ee Cro w|| exp |-i= (tort1 + vw) | dx, dy, (4-10) 


Thus, aside from multiplicative amplitude and phase factors that are 
independent of (x1,y1), the function U(xo,yo) may be found from a Fourier 
transform of U(x1,y1) exp [7(k/2z) (a1? + y1?)], where the transform must 
be evaluated at frequencies (fx = xo/Az, fy = yo/Az) to assure the correct 
space scaling in the observation plane. 

Returning to the first form, (4-9), the convolution nature of that 
equation immediately suggests that perhaps some additional insight can 
be gained by examining the effects of Fresnel diffraction in the spatial 
frequency domain. Accordingly, the space-invariant weighting function 
(4-7) can be directly Fourier transformed to yield a transfer function 


H(fx,fy) = exp (jkz) exp [—jrdz(fx? + fr’)] (4-11) 


describing the effects of propagation in the region of Fresnel diffraction. 
The first exponential factor in this expression represents an overall phase 
retardation suffered by any component of the angular spectrum as it 
propagates between the two planes separated by distance z. The second 
exponential factor represents a phase dispersion with a quadratic fre- 
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quency dependence. Comparison of Eqs. (4-11) and (3-46) shows that the 
above transfer function is simply an approximation to the more general 
transfer function derived in Sec. 3-6. 


The Fraunhofer approximation 


Diffraction-pattern calculations can be further simplified if restrictions 
more stringent than those used in the Fresnel approximations are adopted. 
In particular, it was seen earlier that in the region of Fresnel diffraction 
the observed field strength U(xo,yo) can be found from a Fourier transform 
of the product of the aperture distribution U(x1,y:) with a quadratic 
phase function exp [j(k/2z)(x1? + yi”)]. If in addition the stronger 
(Fraunhofer) assumption 
ey Ber Ua see 


z2> : (4-12) 


is adopted, then the quadratic phase factor is approximately unity over 
the entire aperture, and the observed field distribution can be found 
directly from a Fourier transform of the aperture distribution itself. 
Thus in the region of Fraunhofer diffraction, 


_k 
exp (jkz) exp E as Cay = w)| 
prez 


7 2 
Ml U(x1,y1) exp | =i = (xot1 + my) | dx, dy; (4-13) 


U(xo,yYo) = 


Aside from the multiplicative factors preceding the integral, this expres- 
sion is simply the Fourier transform of the aperture distribution, evaluated 
at frequencies (fx = Xo/dz, fy = Yo/dz). 

At optical frequencies, the conditions required for the validity of the 
Fraunhofer diffraction equation can indeed be severe ones. For example, 
at a wavelength of 6 X 10-7 meter (red light) and an aperture width of 
2.5 cm (1 inch), the observation distance z must satisfy 


z > 1,600 meters 


Nonetheless, the required conditions are met in a number of important 
problems. In addition, Fraunhofer diffraction patterns can be observed 
at distances closer than implied by (4-12), provided the aperture is 
illuminated by a spherical wave converging toward the observer (see 
Prob. 4-3), or if a converging lens is properly situated between the observer 
and the aperture (see Chapter 5). 

Finally, it should be noted that there exists no transfer function that 
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can be associated uniquely with Fraunhofer diffraction, for the approxima- 
tion (4-12) has destroyed the space invariance of the diffraction equation 
(cf. Prob. 2-10). However, since Fraunhofer diffraction is simply a limiting 
case of Fresnel diffraction, the transfer function (4-11) remains valid 
throughout both the Fresnel and the Fraunhofer regimes. 


4-2. EXAMPLES OF FRAUNHOFER DIFFRACTION PATTERNS 


We consider next a number of examples of Fraunhofer diffraction cal- 
culations. For additional examples the reader may consult the problems 
(see Probs. 4-1, 4-2, and 4-3). 

The results of the preceding section can be applied directly to find 
the complex field distribution across the Fraunhofer diffraction pattern 
of any given aperture. However, it should be noted that real radiation 
detectors, including the eye, respond to optical intensity! rather than 
field amplitude. The final description of the diffraction patterns will 
therefore be a distribution of intensity. 


Rectangular aperture 


Consider first a rectangular aperture with an amplitude transmittance 


given by 
x n 
t(r1,y1) = rect (2) rect (“*) 
lx ly 


The constants lx and ly are simply the respective widths of the aperture 
in the x; and y; directions. If the aperture is normally illuminated by a 
unit-amplitude, monochromatic plane wave, then the field distribution 
across the aperture is equal to the transmittance function ¢. Thus using 
Kq. (4-13), the Fraunhofer diffraction pattern of the aperture is seen to be 


exp (jkz) exp E os (Zo? + w’)| 


U(xo,yo) = F{U(x1,y1) } 


fx =x0/dz 
fy =yo/dz 


rz 
Noting that F{U(x1,y1) } = lxly sine(/yfx) sine(lyfy) 
we find 


Kk 
exp (jkz) exp E ap eth w| 
< lxly sine (22) sine lyYo 
rz XY Ve o 


' The intensity of an optical field is defined here by J(P) = 2(u?(P,t)), where the 
angle brackets indicate an infinite time average. For the case of a monochromatic 
field of complex amplitude U(P), this expression reduces to I(P) = |u(P)|?. 


U(x, yo) a 
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Figure 4-2 Cross section of the Fraunhofer diffraction pattern of a rectangular 
aperture. 


lx’ly? . lxao\ . ly: 
and T(x0,Yo) = a sinc? (22) sinc? (=) (4-14) 


Figure 4-2 shows a cross section of the Fraunhofer diffraction pattern 
along the x» axis. Note that the width between the first two zeros (the 
width of the main lobe) is 


r 
ae yee (4-15) 
lx 
Figure 4-3 shows a photograph of the diffraction pattern produced by a 
rectangular aperture with a width ratio of lx/ly = 2. 


Circular aperture 


Consider a diffracting aperture that is circular rather than rectangular, 
and let the diameter of the aperture be /. Thus, if 7; is a radius coordinate 
in the plane of the aperture, then 


t(r1) = cire (#) 
1/2 


The circular symmetry of the problem suggests that the Fourier trans- 
form in Eq. (4-13) be rewritten as a Fourier-Bessel transform. Thus if ro 
is the radius coordinate in the observation plane, we have 
exp (jkz) kro? 
Urs) = —. _— exp (; — 
rz 


i ) @{U(r;)} (4-16) 


22 p=ro/rz 
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Figure 4-8 The Fraunhofer diffraction pattern of a rectangular 
aperture (lx/ly = 2). 


For unit-amplitude, normally incident plane-wave illumination, we have 
U(r1) = t(ri); in addition, 


(ire (28)] 2 (2) neo) 
Sd ae NIP) Ale WOy Lala iy: 
Thus the amplitude distribution in the Fraunhofer diffraction pattern is 
seen to be 
exp (ke) ae (i=) (5) ud 
U(r) qrzZ Ne 22 2, lro/2dz 


TOA THEE GS) kl? | id 
exp (jkz) exp \ J Sze gas |e ie 


| 


and the intensity distribution can be written 


I(r) = (=) |2 ee al (4-17) 


8z klro/2z 


This intensity distribution is generally referred to as the Airy pattern, 
after G. B. Airy who first derived it. Table 4-1 shows the values of the 
Airy pattern at successive maxima and minima, from which it can be 
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Table 4-1 

: [22] a 
ae min 

0 1 max 
1220 0 min 
L689 0.0175 max 
Pe PS 0 min 
2.679 0.0042 max 
3.238 0 min 
3.699 0.0016 max 


Figure 4-4 Cross section of the Airy pattern. 


seen that the radius to the first zero is given by 


r 
Arye 1.22 = (4-18) 


Figure 4-4 shows a cross section of the Airy pattern, while Fig. 4-5 is a 
photograph of the Fraunhofer diffraction pattern of a circular aperture. 


Sinusoidal amplitude grating 


In the previous examples, diffraction was assumed to be caused by 
apertures in infinite opaque screens. We now generalize our concept of a 
diffracting object. 
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Figure 4-5 Fraunhofer diffraction pattern of a circular 
aperture. 


Let the complex amplitude transmittance t(x,y) of a screen be defined 
as the point-by-point ratio of field amplitude immediately behind the 
screen to field amplitude incident on the screen. Until now, only trans- 
mittance functions of the form 


ony 1 in the aperture 
Y’ =~ \0 outside the aperture 


have been considered. It is possible, however, to introduce a prescribed 
amplitude transmittance function within a given aperture. Spatial pat- 
terns of absorption can be introduced with, for example, a photographic 
transparency, thus allowing all real values of t between zero and one to be 
realized. Spatial patterns of phase shift can be introduced by means of 
transparent plates of varying thickness, thus extending the realizable 
values of t to all points within or on the unit circle in the complex plane. 

As an example of this more general type of diffracting screen, con- 
sider a stnusoidal amplitude grating defined by the transmittance function 


t(z,y) = E + 5 008 nf) | rect (5) rect (“) (4-19) 


FRESNEL AND FRAUNHOFER DIFFRACTION / 67 


where for simplicity we have assumed that the grating structure is 
bounded by a square aperture of width J. The parameter m represents the 
peak-to-peak change of amplitude transmittance across the screen, and 
fo is the grating frequency. Figure 4-6 shows a cross section of this trans- 
mittance function. 

If the screen is normally illuminated by a monochromatic, unit- 
amplitude plane wave, the field distribution across the aperture is equal 
simply to t. To find the Fraunhofer diffraction pattern, we first Fourier 
transform that field distribution. Noting that 


2 


and = [rect (*) rect (*)| = /? sinc(/fx) sine(lfy) 


1 
33 F am m cos Crfur)| me 8(fx,fy) + 7 (fx + fof) + 7 fx — fo, fy) 


the convolution theorem can be used to write 
ae ; soe 
${U(a1,y1)} = a sinc (Ify) {sine(Ufx) _ 5 sinc[l(fx + fo)] 
+ F sinell(fx — fo} 
The Fraunhofer diffraction pattern can now be written 
[? ‘ wale . ly . lx 
U(x0,yo) = Pre exp (jkz) exp E = (ao? + w)| sinc (‘““) {sine (“ 


sine E (to + for) | a 5 sine E (xo — je) |} (4-20) 


m 
a 


> 


Figure 4-6 Transmittance function of a sinusoidal amplitude grating. 
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Figure 4-7 Cross section of the Fraunhofer diffraction pattern of a sinusoidal 
amplitude grating. 


Finally, the corresponding intensity is found by squaring (4-20). 
Note that if the grating frequency fo) is much greater than 2/1, there will 
be negligible overlap of the three sine functions that depend on xo, giving 


2 2 
5. Yo) = Ea sinc? (*) {sine (2) 
iis fey Le | +"Fsine |= os || : 
+ A sine |= (xo + fodz) | + q Sine - (xo — fodz) (4-21) 


This intensity pattern is illustrated in Fig. 4-7. Note that the sinusoidal 
transmission variation across the aperture has deflected some of the 
energy out of the central diffraction pattern into two additional side- 
patterns. The central diffraction pattern is called the zero-order com- 
ponent of the Fraunhofer pattern, while the two side patterns are called 
first-order components. The spatial separation of the first-order com- 
ponents from the zero-order component is fo\z, while the width of each 
component is proportional to Az/l. It follows that the wavelength-resolv- 
ing ability of the grating is proportional to the quantity fodz/(Az/l) = fol, 
or the number of sinusoidal fringes on the grating, but is independent of 
the observation distance z. Finally, it should be emphasized that our 
predictions have been based on the scalar formulation, and therefore are 
accurately obeyed only when the grating frequency is small compared 
with \~!. A more exact vectorial treatment, based on Maxwell’s equations, 
is required when this condition is violated. 
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Sinusoidal phase grating 


As a final example of Fraunhofer diffraction calculations, consider a 
sinusoidal phase grating defined by the transmittance function 


t(v1,y1) = exp Ee =a nf) rect (= i ? rect (“) (4-22) 


where by proper choice of phase reference we have dropped a factor 
representing the average phase delay through the grating. The parameter 
m represents the peak-to-peak excursion of the phase delay. 

If the screen is illuminated by a unit-amplitude, normally incident 
plane wave, then the field distribution immediately behind the screen is 
given precisely by Eq. (4-22). The analysis can be simplified by use of 
the identity 


Mm. = mr s 
exp E 3 sin Oxf) | - y Jq (=) exp (j2mgfor1) 


ae 


where J, is a Bessel function of the first kind, order g. Thus 
F{U(xi,y1)} = F{t(wi,yr)} 


(J? sine(/fx) sine(/fy)] * ez 12) 5(fx — Gfo, f)| 


l| 


oo 


» Pie é +) sincll( — qfo)] sine(Ufy) 


qg=-2 


and the field strength in the Fraunhofer diffraction pattern can be written 


Ie 
U(x0,Yo) = 


» |i5, raed else w’)| 


—¢ J (“) sinc EF (ioe afo2)| sinc (#) (4-23) 


If we again assume that fy > 2/1, there is negligible overlap of the various 
diffracted terms, and the corresponding intensity pattern becomes 


OND = lyo 
I (20,40) = (<) > Oh (5) sinc? E (to — afr) | sinc? (4 ) (4-24) 


The introduction of a sinusoidal phase grating has thus deflected 
energy out of the zero-order component into a multitude of higher-order 
components. The peak intensity of the gth-order component is simply 
[227 ,(m/2)/dz]*, while the displacement of that component from the center 
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Figure 4-8 Cross section of the Fraunhofer diffraction pattern of a 
sinusoidal phase grating (m = 8). 
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Figure 4-9 J,? (*) US. 5 ior three values of +4. 


of the diffraction pattern is gfo\z. Figure 4-8 shows a cross section of the 
intensity pattern when the peak-to-peak phase delay (m) is 8 radians. 
Figure 4-9 shows a plot of J,?(m/2) vs. m/2 for various values of g. Note 
that whenever m/2 is a root of Jo, the zero-order fringe vanishes entirely! 


4-3. EXAMPLE OF A FRESNEL DIFFRACTION PATTERN 


The calculation of a Fresnel diffraction pattern is in general a far more 
difficult problem than the corresponding Fraunhofer calculation. As an 
illustration of the case of Fresnel diffraction, we shall consider only a 
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square aperture; for additional Fresnel diffraction calculations see 
Probs. 4-3, 4-4, and 4-5. 

Suppose that a square aperture of width / is normally illuminated by 
a monochromatic plane wave of unit amplitude. Then the distribution of 
field immediately behind the aperture may be written 


U(x1,y1) = rect (= i ‘ rect (“) (4-25) 


The convolution form of the Fresnel diffraction equation is most con- 
veniently used, yielding the following expression for the diffracted field: 


k k 
Ueo,y.) = - : ies p {inl = 2) + (yi — ws] dy dys 


—1l/2 


This expression can be separated into the product of two integrals, 


U(co,yo) = APM) g(2,)(ys) (4-26) 
rz 
where I(x) = ie - exp is (ti — 20)" | dx, 


1/2 Vk 
I(Yo) = bao exp E - Gi = | dy1 


These integrals are substantially simplified by the change of variables 


eer) one 
yielding (xo) = Ae foe » (77 et) ae 
(yo) = a ie exp (3 af ‘) a 


where the limits of integration are 


Wee aE (1 Sa) 
| ines =¢ + 2) £2, = per oe 

k (1 k (1 
eG +0) Bae ees) (4-27) 


The integrals J(xo) and J(yo) can be evaluated in terms of tabulated 
functions known as Fresnel integrals, which are defined by 


wi 


I 


1 


a t? a , Tl? = 
C(a) =a cos “> dt S(a) = if sin > dt (4-28) 
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Noting that 


f 
ih exp (4 Fear = fr exp (43 Tet)ar— ff exp (s5e) ae 
we can write 


G(xo) = ee {[C(&2) — C(é&)] + g[S(é2) — S(é)]} 
ah (4-29) 


5(yo) = Re {[C(ns) — C(n)] + SCs) — Sn) ]} 


Finally, substitution of (4-29) in (4-26) yields a complex field distribution 


exp (jkz) 


U(xo,yo) = 2 {[C(é2) — C(é1)] + g[S(é2) — S(Ex)]} 
{[C(n2) — C(n1)] + SG) — S(m)]}_ (4-80) 


and a corresponding intensity distribution 


L(xo,yo) = 4t(C(é2) — C(E))? 
+ [S(E2) — S(é)]?} (CC) — CC)? + [SQm2) — SQ) ?} (4-31) 


Figure 4-10 Cornu’s spiral. 
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A convenient aid in interpreting these expressions is the graphical 
construction known as Cornu’s spiral, which is a simultaneous plot of 
C(a) and S(a) vs. the parameter a, as shown in Fig. 4-10. Note that the 
quantity C(a) + jS(«) can be interpreted as the complex phasor joining 
the origin to the point a on the spiral. It follows that the quantity 
{(C(é2) — C(é1)] + g[S(é2) — S(é1)]} is the phasor defined by a line 
running from point & to point £ on the spiral. Using graphical techniques 
such as this, the expressions (4-30) and (4-31) can be evaluated at each 
point in the diffraction pattern. 

To illustrate this type of calculation, consider the field distribution 
U(xo,yo) at a very small distance z from the aperture (i.e., deep within the 
region of Fresnel diffraction, but still far enough from the aperture for 
the Fresnel approximation to remain valid). For sufficiently small z, 
k/rz is an exceedingly large number, and the following approximations 
to Eqs. (4-27) are accurate: 


— 0 7 a amet ca 05 
to m<—5 $2 <5 
l l 
== 8 Uke a5 mace Yor s 
m= n= 
l l 
+a UO ais + 20 yo< 5 


Referring to Cornu’s spiral, the corresponding values of C(a) and S(a) 
are found to be 


Poeeeeee 
Ci) = Sq) =} 7 . 
ae Lig Pals 
lee Be aoe 
CEy=sey=\ 4 : 
ee 
eee 
CAMEO R= | 
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C(n2) = S(n2) = 
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2 
Substituting these results in Eq. (4-30), we find 


U(20,yo) = exp (kz) rect () rect (*) (4-32) 


Thus deep within the region of Fresnel diffraction, the field distribution 
U(2o,yo) is the geometrical projection of the aperture distribution U(x1,y1), 
and the predictions of geometrical optics are therefore accurate. This 
result is, of course, not peculiar to the rectangular aperture but can be 
shown to hold for arbitrary apertures with the help of the principle of 
stationary phase [Ref. 3-4, app. III]. 


PROBLEMS 


4-1 Assuming unit-amplitude, normally incident plane-wave illumination: 
(a) Find the intensity distribution in the Fraunhofer diffraction pattern of the 
double-slit aperture shown in Fig. P4-1. 
(b) Sketch cross sections of this pattern that appear along the zo- and yo- axes 
in the observation plane, assuming X/Az = 10 m~!, Y/xXz = 1 m™, and A/Az = 
39m ~!, z being the observation distance and \ the wavelength. 


Yi 


Figure P4-1 
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4-2 (a) Sketch the aperture described by the transmittance function 


Uri,y1) = {| ree (2) rect @) * \3 comb (“) xr» | rect (%) 


where JN is an odd integer and A > Y. 

(b) Find an expression for the intensity distribution in the Fraunhofer diffrac- 
tion pattern of that aperture, assuming illumination by a normally incident 
plane wave and N > 1. 

(c) What relationship between Y and A can be expected to minimize the strength 
of the even-order diffraction components while leaving the zero-order component 
approximately unchanged? 


4-3 An aperture © in an opaque screen is illuminated by a spherical wave converging 
toward a point P located in a parallel plane a distance z behind the aperture, as 
shown in Fig. P4-3. 

(a) Find a quadratic approximation to the illuminating wavefront in the plane 
of the aperture, assuming that (1) P lies on the z axis, and (2) P lies at coordinates 
(0, Yo) . 

(6) Assuming Fresnel diffraction from the plane of the aperture to the observa- 
tion plane containing P, show that in both of the above cases the observed 
intensity distribution is the Fraunhofer diffraction pattern of the aperture, 
centered on the point P. 


Figure P4-3 


4-4 Find the intensity distribution on the aperture axis in the Fresnel diffraction 
patterns of apertures with the following transmittance functions (assume 
normally incident, unit-amplitude plane-wave illumination): 


(a) Uai,y1) = cireW/ x1? + yi? 
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4-5 


eee a<Varty? <i 
(6) Urry) = 0 otherwise 


where a < 1. 


Two discrete spectral lines of a luminous source are said to be “just resolved”’ 
by a diffraction grating if the peak of the gth-order diffraction component due 
to the source wavelength A, falls exactly on the first zero of the gth-order diffrac- 
tion component due to the source wavelength A». The resolving power of the 
grating is defined as the ratio of the mean wavelength \ to the minimum resoly- 
able wavelength difference AX. Show that the resolving power of the sinusoidal 
phase grating discussed in this chapter is 

r 

Roma ee 
where q is the diffraction order used in the measurement and M is the number 
of spatial periods of modulation ruled on the grating. What phenomena limit 
the use of arbitrarily high diffraction orders? 


Assuming normally incident plane-wave illumination, and neglecting finite 
aperture extent: 
(a) Find the Fresnel diffraction pattern of a screen with the following trans- 
mittance function: 

U(az,y) = 19(1 + m cos 2rfor) 


(b) Given that m <1, at what distances z from the aperture is the field distri- 
bution across a parallel plane (1) purely ampliiude modulated over space? (2) 
approximately phase modulated over space? 


Find an expression for the intensity distribution in the Fraunhofer diffraction 
pattern of the aperture shown in Fig. P4-7. Assume unit-amplitude, normally 
incident plane-wave illumination. 


<—~ > 
tS 


Figure P4-7 


5 FOURIER TRANSFORMING AND 
IMAGING PROPERTIES OF LENSES 


The most important components of optical imaging and optical data- 
processing systems are, of course, lenses. While a thorough discussion of 
the properties of lenses would be helpful, such a treatment would require 
a rather lengthy detour through the basic principles of geometrical optics. 
Since a complete discussion is not essential for our purposes, we shall adopt 
an alternative point of view which, while not resting directly on the 
principles of geometrical optics, is nonetheless entirely consistent with 
them. Our discussions will be limited to the case of monochromatic 
illumination, with generalization to the nonmonochromatic case being 
deferred to Chap. 6. 


5-1 A THIN LENS AS A PHASE TRANSFORMATION 


A lens is composed of optically dense material, usually glass, in which the 
propagation velocity of an optical disturbance is less than the velocity 
in air. A lens is said to be a thin lens if a ray entering at coordinates (x,y) 
on one face emerges at approximately the same coordinates on the opposite 
face, i.e., if there is negligible translation of the ray within the lens. Thus 
a thin lens simply delays an incident wavefront by an amount proportional 
to the thickness of the lens at each point. 

Referring to Fig. 5-1, let the maximum thickness of the lens be Ao, 
and let the thickness at coordinates (x,y) be A(z,y). Then the total phase 
delay suffered by the wave at coordinates (z,y) in passing through the 
lens may be written 


where n is the index of refraction of the lens material, knA(z,y) is the 
phase delay introduced by the lens, and k[A) — A(z,y)] is the phase delay 
introduced by the remaining region of free space between the two planes. 
Equivalently, the lens may be represented by a multiplicative phase 
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Figure 6-1 The thickness 
function. 


transformation of the form 
ti(x,y) = exp [jkA] exp [jk(n — 1)A(z,y)] (5-1) 


The complex field U;(z,y) across a plane immediately behind the lens is 
then related to the complex field U,(z,y) incident on a plane immediately 
in front of the lens by 


Ui(x,y) = ti(z,y)UL(z,y) (5-2) 


The problem remains to specify the mathematical form of the thickness 
function A(x,y) in order that the effects of a lens may be understood. 


The thickness function 


In order to specify the forms of the phase transformations introduced by 
a variety of different types of lenses, we first adopt a sign convention: 
as rays travel from left to right, each convex surface encountered is taken 
to have a positive radius of curvature, while each concave surface is taken 
to have a negative radius of curvature. Thus in Fig. 5-1 the radius of 
curvature of the left-hand surface of the lens is a positive number R,, 
while the radius of curvature of the right-hand surface is a negative 
number Re. 

To find the thickness A(z,y), we split the lens into two parts, as 
shown in Fig. 5-2, and write the total thickness function as the sum of 
the two individual thickness functions, 


A(a,y) = Ar(x,y) + A2(x,y) (5-3) 


Referring to the geometry shown in that figure, the thickness Aj(z,y) is 
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seen to be given by 


Ai(z,y) = do — (Ri — VR? — 2? — y?) 
=a = Pi(1 =f —2 7) (5-4) 
R;? 
Similarly, Ao(x,y) = Ace — (— Re — JSR? — x? — y?) 


(5-5) 
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where we have factored the positive number — R2 out of the square root. 
Combining (5-3), (5-4), and (5-5), the total thickness function is seen to be 


giao ety 
A(zx,y) = 4) — RB (1 - ee a) 


1 


where Ao = Ao1 + Aoo. 


Figure 5-2 Calculation of the thickness function. 
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The paraxial approximation 


The expression for the thickness function can be substantially simplified 
if attention is restricted to portions of the wavefront that lie near the 
lens axis, or equivalently, if only paraxial rays are considered. Thus we 
consider only values of x and y sufficiently small to allow the following 
approximations to be accurate: 


4/1 Dt eter e+ y 
Ree 2Ry 


(5-7) 
1 a? —- af? es 1 x? a y? 
\ icatlipide Rates 2R2? 


2 


The resulting phase transformation will, of course, represent the lens 
accurately over only a limited area, but this limitation is no more restric- 
tive than the usual paraxial approximation of geometrical optics. Note 
that the relations (5-7) amount to approximations of the spherical surfaces 
of the lens by parabolic surfaces. With the help of these approximations, 
the thickness function becomes 


A(z,y) = Ao — oc (= - x) (5-8) 


The phase transformation and its physical meaning 


Substitution of Eq. (5-8) into Eq. (5-1) yields the following paraxial 
approximation to the lens transformation: 


é , rae ea py fo 1 
t(x,y) = exp [jknA,] exp | aiken — 1) 9 = (; — x) 


The physical properties of the lens (that is, n, R1, and R2) can be com- 
bined in a single number f called the focal length, which is defined by 


i 1 1 
jae-0(E 3) ie 


Thus the phase transformation may be rewritten 


ti(z,y) = exp [jknAy] exp E é Coes | (5-10) 


This equation will serve as our basic representation of the effects of a 
lens on an incident disturbance. 

Note that while our derivation of this expression has assumed the 
specific lens shape shown in Fig. 5-1, the sign convention adopted allows 
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Double-convex Plano-convex Positive 
meniscus 

Double-concave Plano-concave Negative 
meniscus 


Figure 5-3 Vartous types of lenses. 


the result to be applied to other types of lenses. Figure 5-3 illustrates 
several different types of lenses with various combinations of convex and 
concave surfaces. In Prob. 5-1, the reader is asked to verify that the sign 
convention adopted here implies that the focal length f of a double- 
convex, plano-convex, or positive meniscus lens is positive, while that of a 
double-concave, plano-concave, or negative meniscus lens is negative. 
Thus Eq. (5-10) can be used to represent any of the above lenses, provided 
the correct sign of the focal length is used. 

The physical meaning of the lens transformation can best be under- 
stood by considering the effect of the lens on a normally incident, unit- 
amplitude plane wave. Thus the field distribution U, in front of the lens 
is unity, and Eq. (5-2) yields the following expression for U) behind the 


lens: 


k 
Ui (x,y) = exp [jknAo] exp E of (a2 f v") | 


The first term is simply a constant phase delay, while the second term we 
may interpret as a quadratic approximation to a spherical wave. If the 
focal length f is positive, then the spherical wave is converging toward 
a point on the lens axis a distance f behind the lens. If f is negative, 
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Figure 5-4 Effect of a converging lens and a 
diverging lens on a normally incident 
plane wave. 


then the spherical wave is diverging about a point on the lens axis a 
distance f in front of the lens. These two cases are illustrated in Fig. 5-4. 
Thus a lens with a positive focal length is called a converging lens (or a 
positive lens), while a lens with a negative focal length is a diverging lens 
(or a negative lens). 

Our conclusion that a lens composed of spherical surfaces maps an 
incident plane wave into a spherical wave is very much dependent on the 
paraxial approximation. Under nonparaxial conditions, the emerging 
wavefront will exhibit departures from perfect sphericity (called aberra- 
tions—see Sec. 6-4), even if the surfaces of the lens are perfectly spherical. 
In fact, lenses are often ‘‘corrected”’ for aberrations by grinding their 
surfaces aspherical in order to improve the sphericity of the emerging 
wavefront. 

We should emphasize, however, that the results which will be derived 
using the multiplicative phase transformation (5-10) are actually more 
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general than the analysis leading up to that equation might imply. A 
thorough geometrical-optics analysis of most well-corrected lens systems 
shows that they behave in essentially the same way predicted by our 
more restricted theory. 


5-2 FOURIER TRANSFORMING PROPERTIES OF LENSES 


One of the most remarkable and useful properties of a converging lens 
is its inherent ability to perform two-dimensional Fourier transformations. 
The Fourier transforming operation is one with which we generally 
associate bulky, complex, and expensive electronic spectrum analyzers, 
yet this complicated analog operation can be performed with extreme 
simplicity in a coherent optical system. 

In the material that follows, three separate configurations for 
performing the transform operation are considered. In all cases the 
illumination is assumed monochromatic and the distribution of light 
amplitude across the back focal plane! of the lens is of concern. Figure 
5-5a—c illustrates the three configurations considered. In case (a) the 
object to be transformed is placed directly against the lens itself. In 
case (b) the object is placed a distance dp in front of the lens. In case (c) 
the object is placed behind the lens at a distance d from the focal ptane. 

For supplementary discussions of the Fourier transforming properties 
of lenses, the reader may consult Ref. 5-1, 5-2, or 5-3. 


Object placed against the lens 


Let a plane object with amplitude transmittance t.(z,y) be placed immed- 
iately in front of a converging lens of focal length f, as shown in Fig. 
5-5a. The object is assumed to be uniformly illuminated by a normally 
incident, monochromatic plane wave of amplitude A, in which case the 
disturbance incident on the lens is 


U,(z,y) = At.(z,y) (5-11) 


The finite extent of the lens aperture can be accounted for by associating 
with the lens a pupil function P(x,y) defined by 


Pome 1 inside the lens aperture 
(ty) = 0 otherwise 


Thus the amplitude distribution behind the lens becomes, using (5-10), 
k 
Uien) =UewPeven [Tz +u)] G12 


1 The front and back focal planes of a lens are defined as planes normal to the 
lens axis, situated at distance |f| in front of and in back of the lens, respectively. 
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Figure 5-5 Fourter transforming configurations. 
(a) Object placed against the lens; 
(b) object placed in front of the lens; 
(c) object placed behind the lens. 


The constant phase delay associated with the lens transformation has 
been omitted since it does not affect the result in any significant 
way. 

To find the distribution U;(x;,y;) of field amplitude across the back 
focal plane of the lens, the Fresnel diffraction formula (4-10). is applied. 
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Thus, putting 2 =f, 


_k 
exp E of (ay? + w| » 


a ae 


U;(xy,ys) — U;(z,y) 


evel ae . 2 

exp | J SE + y*) | exp | —j — (wx, + yy,) | dx dy (5-13) 
of Mf 

where a constant phase factor has been dropped. Substituting (5-12) in 


(5-13), the quadratic phase factors within the integrand are seen to 
cancel, leaving 


exp EE Y, Carag w| 


IM 


[f U,(z,y)P(z,y) exp |= “ae v (aa; + w| dxdy (5-14) 


U;(zy,ys) a 


Thus the field distribution U; is proportional to the two-dimensional 
Fourier transform of that portion of the incident field subtended by the 
lens aperture. When the physical extent of the object is smaller than the 
lens aperture, the factor P(xz,y) may be neglected, yielding 


A exp lis of Cees w| 


IM 


if to(x,y) exp [137 (wes + w | dx dy (5-15) 


U; (xz,Ys) = 


Evidently the amplitude and phase of the light at coordinates (a;,y,) 
are influenced by the amplitude and phase of the object Fourier com- 
ponent at frequencies (fx = x;/Af, fy = y,/Af). 

Note that the Fourier transform relation between the object and the 
focal-plane amplitude distribution is not an exact one, due to the presence 
of the quadratic phase factor that precedes the integral. While the phase 
distribution across the focal plane is not the same as the phase distri- 
bution across the object spectrum, the difference between the two is a 
simple phase curvature. In most cases it is the intensity across the focal 
plane that is of real interest. Measurement of the intensity distribution 
yields knowledge of the power spectrum of the object; the phase distri- 
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bution is of no consequence in such a measurement. Thus 


if ji t.(x,y) exp |-1= (way + w | dx dy | (5-16) 


A?2 
I ,(xz,ys) a rf? 


Object placed in front of the lens 


Consider next the more general geometry of Fig. 5-5b. The object, 
located a distance d, in front of the lens, is illuminated by a normally 
incident plane wave of amplitude A. The amplitude transmittance of the 
object is again represented by t.. In addition, let F.(fx,fy) represent 
the Fourier spectrum of the light transmitted by the object, and F,(fx,fr) 
the Fourier spectrum of the light incident on the lens; that is, 


Fo(fx,fy) = F{Ato} Fi(fx,fy) = F{U)} 


Assuming that the Fresnel approximations are valid for propagation over 
the distance d,, then F, and F, can be related by means of Eq. (4-11), 
giving 

Fi(fx,fy) = Fo(fx,fy) exp [—jrAd.(fx? + fr?)] (5-17) 
where we have dropped a constant phase delay. 


For the moment, the finite extent of the lens aperture will be neg- 
lected. Thus, letting P = 1, Eq. (5-14) can be rewritten 


exp liz oF (zy? + w| Le 
U;(2y,yy) = AY F, (% 4) (5-18) 


Substituting (5-17) in (5-18), we have 


wf (-—$)oe +09 
p 2f f vy 5") : @ ut) 


iM Mf A 


& ae 
A exp E of ( -- Yay _ w| 


IM 


I i (25. Ua) exp |-i3 (Lory + vs) | an. dYyo (5-19) 


U;(x;,ys) = = 


or U,;(x;,ys) = 


Thus the amplitude and phase of the light at coordinates (x;7,Yy) 
are again related to the amplitude and phase of the object spectrum at 
frequencies (x;/Af, y;/Af). Note that a phase factor again precedes the 
transform integral, but that it vanishes for the very special case d, = iF 
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Figure 5-6 Vignetting of the object. 


Evidently when the object is placed in the front focal plane of the lens, the 
phase curvature disappears, leaving an exact Fourier transform relation! 

To this point we have entirely neglected the finite extent of the lens 
aperture. To include the effects of this aperture, we use a geometrical- 
optics approximation. Such an approximation is an accurate one if the 
distance d, is sufficiently small to place the object deep within the region 
of Fresnel diffraction for the lens aperture. This condition is satisfied in 
the vast majority of problems of interest. With reference to Fig. 5-6, the 
light amplitude at coordinates (z;,y;) is a summation of all the rays 
traveling with direction cosines (a = x,/f, 8 = y;/f). However, only a 
finite portion of these rays is collected by the lens aperture. Thus the 
finite extent of the lens aperture may be accounted for by projecting that 
aperture back into the object plane, the projection being centered on a 
line joining the coordinates (z;,y,;) with the center of the lens (see Fig. 
5-6). Again the lens aperture limits the effective extent of the object, but 
this time the particular portion of t, that affects U, depends on the coor- 
dinates (x;,y;). As implied by Fig. 5-6, the value of U, at (x;,y;) can be 
found from the Fourier transform of that portion of the object subtended 
by the pupil function P, centered at coordinates [7 = —(d./f)zys, 


Yo = —(d./f)ys|. Thus 


"i d, 
A exp Ee (1 = ?) (iy? oe w)| 


U;(xs,ys) = BN 
i d. d. 2a 
ee a ae aes ; dx, dYo 
IJ to(%0,Yo)P (2. oa j £7, Yet F ) exp j yf (ees oy w)| Lo dy 


(5-20) 
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The limitation of the effective object by the finite lens aperture is 
known as a vignetting effect. Note that vignetting in the object space is 
minimized when the object is placed close to the lens and when the lens 
aperture is much larger than the object. In practice it is often preferred to 
place the object directly against the lens in order to minimize vignetting, 
although in analysis it is generally convenient to place the object in the 
front focal plane, where the Fourier transform relation is exact. 


Object placed behind the lens 


Finally we consider the case of an object that is placed behind the lens, 
as shown in Fig. 5-5c. The object again has transmittance t., but is now 
located a distance d from the focal plane of the lens. Let the lens be 
illuminated by a normally incident plane wave of amplitude A. Then 
incident on the object is a spherical wave converging toward the back 
focal point of the lens. 

In the geometrical-optics approximation, the amplitude of the spher- 
ical wave impinging on the object is Af/d; the particular region of the 
object that is illuminated is determined by the intersection of the con- 
verging cone of rays with the object plane. If the lens is circular of 
diameter J, then a circular region of diameter /d/f is illuminated in the 
object space. The finite extent of this illuminating spot can be represented 
mathematically by projecting the pupil function of the lens down the 
cone of rays onto the object, yielding an effective pupil function 
Plxo(f/d), yo(f/d)] in the object plane. Note that the object transmittance 
t. will have a finite aperture associated with it; the effective aperture in 
the object space is thus determined by the intersection of the true object 
aperture with the projected pupil function. If the object aperture is 
fully illuminated, then the projected pupil function may be ignored. 

Using a paraxial approximation to the spherical wave that illuminates 
the object, the field amplitude transmitted by the object may be written 


A k 
Uy Ua) — [oP (2. : Yo ‘) exp | =i Dd Ce + || to (Lo, Yo) (5-21) 


Assuming Fresnel diffraction from the object plane to the focal plane, 
Eq. (4-10) can be used to show 


Skoog 
A exp E Da (he ate w)| f 
jad d 


U;(x;,ys) = 


[f seous? (2 Sra) ex [3% 
JJ o(o,Yo)P \ Xo 7 Yo = ) EXP TI Sg ors + yous) dtp dyo (5-22) 
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Figure 5-7 Optically obtained Fourier spectrum of the character “3.” 


Thus, up to a quadratic phase factor, the focal-plane amplitude distri- 
bution is the Fourier transform of that portion of the object subtended 
by the projected lens aperture. 

The result represented by Eq. (5-22) is essentially the same result 
obtained previously when the object was placed directly against the lens 
itself. However, an extra flexibility has been obtained in the present 
configuration; namely, the scale of the Fourier transform is under the 
control of the experimenter. By increasing d, the spatial size of the trans- 
form is increased, at least until the transparency is directly against the 
lens (that is, d = f). By decreasing d, the size of the transform is made 
smaller. This flexibility can be of considerable utility in spatial filtering 
applications (see Chap. 7), where some potential adjustment of the scale 
of the transform is often of considerable help. 

Finally, we illustrate with a typical example the type of two- 
dimensional Fourier analysis which can be achieved optically with great 
ease. Figure 5-7 shows on the left a transparent character ‘3’? which is 
placed in front of a positive lens and coherently illuminated, yielding in 
the back focal plane a spectrum of the character, as shown on the right. 
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Figure 5-8 Geometry for image formation. 


Note in particular the high-frequency components introduced by the 
straight portions of the character. 


5-3 IMAGE FORMATION: MONOCHROMATIC ILLUMINATION 


Certainly the most familiar property of lenses is their ability to form 
images. If an object is placed in front of a lens and illuminated, then under 
appropriate conditions there will appear across a second plane a distri- 
bution of light intensity that closely resembles the object. This distribu- 
tion of intensity is called an image of the object. The image may be real 
in the sense that an actual distribution of intensity appears across a 
plane behind the lens, or it may be virtual in the sense that the light 
behind the lens appears to originate from an intensity distribution across 
a new plane in front of the lens. 

For the present, we consider image formation in only a limited con- 
text. First, we restrict attention to a positive, aberration-free thin lens 
that forms a real image. Second, we consider only monochromatic illumina- 
tion, a restriction implying that the imaging system is linear in complex 
field amplitude. Both these restrictions will be removed in Chap. 6, 
where the problem of image formation will be treated in a much more 
general fashion. 


The impulse response of a positive lens 


Referring to the geometry of Fig. 5-8, suppose that a planar object is 
placed a distance d, in front of a positive lens and is illuminated by 
monochromatic light. We represent the complex field immediately 
behind the object by U,.(x,y.). At a distance d; behind the lens there 
appears a field distribution that we represent by U,(z;,y;). Our purpose is 
to find the conditions under which the field distribution U; can reasonably 
be said to be an “image” of the object distribution U,. 
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In view of the linearity of the wave-propagation phenomenon, we 
can in all cases express the field U; by the following superposition integral: 


U:(ra,ys) = if A(2:,Ys3Lo, Yo) Vo(Xo,Yo) dx, dYo (5-23) 


where h(2;,Y:;%,Yo) is the field amplitude produced at coordinates (Xi, Ys) 
by a unit-amplitude point source applied at object coordinates (a,,yo). 
Thus the properties of the imaging system will be completely described 
if the impulse response h can be specified. 

If the optical system is to produce high-quality images, then U; 
must be as similar as possible to U,. Equivalently, the impulse response 
should closely approximate 


h(z:,Yij%o,Yo) = Ko(a; + Mao, y; + My.) (5-24) 


where K is a complex constant, M represents the system magnification, 
and the plus or minus signs are included to allow for possible image 
inversion. We therefore shall specify as the “image plane” that plane 
where (5-24) is most closely approximated. 

To find h, let the object be a 6 function (point source) at coordinates 
(xo,yo). Then incident on the lens will appear a spherical wave, diverging 
about the point (z.,y.). The paraxial approximation to that wave is 
written 


Ui(z,y) = : [(Gea,)* iy = we) (5-25) 


exp lj Dd, 


Jrdo 


After passage through the lens, the field distribution becomes 
1 fk 2 2 iy) 
Ur(x,y) = Ur(z,y)P(@,y) exp | —7 of (x? + y”) (5-26) 


Finally, using the Fresnel diffraction equation (4-9) to account for 
propagation over the distance d;, we have 


h(@i,YijLo,Yo) = — + II U; (2, y) 


exp lee [@i — 2)? + (ye — y)?]pdedy (5 27) 
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where constant phase factors have been dropped. Combining (5-25), 
(5-26), and (5-27) yields the formidable result 


sac. [7a 0 + | om [gg 2 + 9) | 
ic P(x,y) exp lis : (+ “3 ; = see + | 


A Xo Xi Yo Yi 
a9 oh tad be i a ee 5-28 
exp | gk (= _ =) _ (® ~ s) v|| dxdy ( ) 


Equations (5-23) and (5-28) now provide a formal solution specifying 
the relationship that exists between U, and U;. However, it is difficult 
to determine the conditions under which U; can reasonably be called an 
image of U, unless further simplifications are adopted. 


A(2i,Yi;LojYo) = 


Eliminating the quadratic phase factors: the lens law 


The most troublesome terms of the impulse response above are those 
containing quadratic phase factors. Note that two of these terms are 
independent of the lens coordinates (x,y), namely, 


exp Ee — (a? + ve) | and exp Ea 24 ue) | 


These terms are simply indicative of phase curvatures over the xy; 
and x.y. planes. If we were willing to consider image formation between 
two spherical surfaces, rather than between two planes, they could be 
directly dropped. However, even for the case of image formation between 
two planes, these two terms can be sae to be immaterial. 


To eliminate the factor exp Ee 2 ¥; »|, note that in the vast 


majority of cases of interest, the fee ee of light behind the lens will 
be the end product of the imaging operation. This distribution will 
therefore be directly sensed by a detector that responds only to light 
intensity (e.g., photographic film). Since the term in question modifies 
only the phase distribution of the light, it will in no way affect the results 
of the measurement and therefore can be dropped. 


k 
Unfortunately, the phase factor exp ix (to? + we’) | cannot be 


disposed of so directly, for it depends on the variables of integration 
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Figure 5-9 The region R where h has significant value for the 
particular coordinates (x;,y;) shown. 


(Xo,yo) of the superposition integral.! However, it can be eliminated in 
most cases of interest by the following argument. If the imaging system 
is to behave in a fashion approximating the ideal behavior of Eq. (5-24), 
the light amplitude at coordinates (zx,,y;) must consist of contributions 
from only a tiny region of the object space, centered on the ideal geo- 
metrical object point (see Fig. 5-9). If within that tiny region the argu- 


k : 
ment of exp E Dd. (xo? + w’)| changes by no more than a fraction of a 


radian, then we may use the approximation 


exp lise (to? + wd) | = ~ exp E a (ee) (5-29) 


Since the (z.,y.) dependence has been removed, the exponential may now 
be dropped by again noting that it will not affect the intensity measure- 
ment in the z,y; plane. For more consideration of this approximation 
and the conditions for its validity, see Prob. 5-7. 

By means of the above arguments, we have simplified the expression 
for the impulse response to 


(xi,Yi;To,Yo) = = Z if P(x,y) exp E AG +5 rh i) (2? + | 


mol-alleed)-r(e jaw om 


As a final simplification, we restrict attention to a particular plane 


1 When the illumination is “incoherent,” the imaging system can be shown to be 
linear in intensity, with impulse response |h|*. In such a case, both phase factors can 
be directly dropped. For more details see Chap. 6. 
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behind the lens, namely that for which the distance d; satisfies 
apis oe re) (5-31) 


This relation is familiar from elementary geometrical optics, where it is 
known as the lens law. It specifies the particular distance behind the lens 
where the rays emanating from a single object point will again cross in 
an image point. From this geometrical-optics point of view, the lens law 
must be satisfied if an impulse response resembling the ideal is to be 
achieved. Assuming, then, that the lens law is satisfied, the impulse 
response reduces to 


hziysitoye) & aa 7 if P(x,y) 


. 0 Li Yo Yi 
— — + — = dx d 5-32 
exp | t[(+2)2+ (4+) ,I| xdy (5-32) 
Defining the magnification of the system by 
== (5-33) 


we find the final simplified form of the impulse response 


A(xi, YijCo,Yo) = I P(x,y) 


= d; 


exp is vi [a + Ma)x + (yi + M woul} dxdy (5-34) 

Thus if the lens law is satisfied, the impulse response is seen to be 
given by the Fraunhofer diffraction pattern of the lens aperture, centered 
on image coordinates (x; = —Mz,, y; = —My.). The occurrence of a 
Fraunhofer diffraction formula should not be entirely surprising. By 
choosing d; to satisfy the leus law, we are examining the plane toward 
which the spherical wave leaving the lens is converging. From the results 
of Prob. 4-3, we should expect the distribution of light about the point of 
convergence to be precisely the Fraunhofer diffraction pattern of the 
lens aperture that limits the extent of that spherical wave. 


The relation between object and image 


Consider first the nature of the image predicted by geometrical optics. 
To find this ideal image, we shall use the common artifice of allowing the 
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wavelength ) to approach zero, in which case diffraction effects become 
negligible. With the change of variables 


Oe Fa a, 
i = = 
Vieuw = vi 


(5-35) 


the impulse response (5-34) can be rewritten 
h(xi,y;to,yo) = M ff Padtrag) 


exp {—j2n[(zi + Ma.)# + (yi + My.)§]} dé dg (5-36) 


As ) approaches zero, the range of (#,7) over which P equals unity will 
grow without bound, allowing P to be replaced by unity while retaining 
the infinite limits of integration. Thus 


h(xisysste.yo) 2 M ff exp {—J2nl(xi + Max)e + (ys + My.)gl} az ag 
= M 6(2: = Mx, Yi = My.) 

7; (= + a, = 4 

a 7 eg v Wo (5-37) 


Using this result in the superposition integral (5-23), the object and image 
amplitude distributions are seen to be related by 


if Li i 
Ui(zi,yi) = i U, (- pe #) (5-38) 


We conclude, then, that the image predicted by geometrical optics is an 
exact replica of the object, magnified and inverted in the image plane. 

The predictions of geometrical optics are, of course, not exact. A 
more complete understanding of the relation between object and image 
will be obtained only when diffraction effects are included. To this end, 
we return to the expression (5-36) for the impulse response, and make the 
following additional change of variables: 


%, = —Mx, Gj. = —My, (5-39) 


in which case we find the impulse response to be 
h (xi, Yi 3£o,Go) = M a P(\d:€,\d,7/) 


exp { —j2a[(zi — £0) + (yi — Go)G]} dé dy (5-40) 


Note that h is now space-invariant, depending only on the coordinate 


96 / INTRODUCTION TO FOURIER OPTICS 


differences (a; — &, yi — 9). With the final definition 


Pe iL 
a 


= —h 5-41 
M ( ) 


the superposition integral (5-23) becomes 


Lo Yo 


o . E 1 i ; , 
Us(2,y:) = i. ii hte — ote = Yo) E U. (- a m) dé, dij. (5-42) 


which we recognize as a convolution of the impulse response h with the 
image predicted by geometrical optics. For convenience we define a new 
function U, which represents the geometrical-optics prediction; that is, 


1 i i 
U,(2:,y:) = M U, (- = Se u) (5-43) 


The convolution (5-42) can then be written in simplified notation as 


Ui(xi,yi) = h(xi,ys) * U,(2:,yi) (5-44) 
where 


(ey) = ff POdt,rdg) exp (—j2x(xt + yg) dz dg (5-45) 


Equations (5-44) and (5-45) represent the final results of our present 
analysis. They indicate that when diffraction effects are included, the 
image is no longer a perfect replica of the object. Rather, the actual image 
obtained is a smoothed version of the object, a consequence of the nonzero 
width of the impulse response h. This smoothing operation can strongly 
attenuate the fine detail of the object, with a corresponding loss of image 
fidelity resulting. An exactly analogous phenomenon can be observed 
when an electrical signal is applied to a linear electrical network. If the 
duration of the impulse response of the network is long compared with 
the “fluctuation time” of the input signal, then the network will smooth 
the input. Thus rapid variations of the input will not be reproduced at 
the output. 

In the case of time-invariant electrical networks, the loss of signal 
fidelity is often most conveniently described in the frequency domain. 
If the input signal contains frequency components which are too high 
to be passed by the transfer function of the network, then the signal will 
not be accurately reproduced at the output. The great utility of frequency- 
analysis concepts in the electrical case suggests that similar concepts might 
be usefully employed in the study of optical imaging systems. The 
application of filtering concepts to imaging systems is, in fact, a subject 
of great importance and will be the primary topic of the chapter to follow. 
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PROBLEMS 


5-1 Show that the focal lengths of double-convex, plano-convex, and positive 
meniscus lenses are always positive, while the focal lengths of double-concave, 
plano-concave, and negative meniscus lenses are always negative. 


5-2. (a) Find a paraxial approximation to the phase transformation introduced by a 
lens that is the portion of a cylinder illustrated in Fig. P5-2. 
(b) What is the effect of such a lens on a plane wave traveling down the lens 
axis? 


ee 


Figure P5-2 


5-3 Consider a lens that consists of the portion of a cone illustrated in Fig. P5-3. 
(a) Show that a paraxial approximation to the phase transformation introduced 
by such a lens is (under the thin-lens assumption) 


(n —1)R x? 
h(a,y) = exp {i | na == h io “all 


RA — y/h 
where fly) = Sete HE. 
n—l 


(b) What is the effect of such a lens on a plane wave traveling normal to the 
zy plane? 


Figure P5-3 
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5-4 


5-5 


5-6 


5-7 


An object function U,, bounded by a circular aperture of diameter J, is impressed 
across the front focal plane of a circular converging lens of diameter L. The 
intensity distribution is measured across the back focal plane of the lens. 
Assuming L > 1: 

(a) Find an expression for the maximum spatial frequency for which the 
measured intensity accurately represents the squared modulus of the object’s 
Fourier spectrum. 

(b) What is the numerical value of that frequency (in cycles/mm) when 
L =4cm, 1 = 2 cm, f (focal length) = 50 cm, and \ = 6 X 1077 meter? 

(c) Above what frequency does the measured spectrum vanish, in spite of 
the fact that the object may have nonzero Fourier components at higher 
frequencies? 


A diffracting screen has a circularly symmetric amplitude transmittance func- 


tion given by 
Ar) 1 1 ENV ae r 
={—-+-—cos irc { - 
r gt 5 cos art } ere |= 


(a) In what ways does this screen act like a lens? 

(b) Give an expression for the focal length of the screen. 

(c) What characteristics might seriously limit the use of this screen as an 
imaging device, particularly for polychromatic objects? 


An array of one-dimensional object functions can be represented by U.(z,y«), 
where yi, Y2, --+ 5 Yk ---+, yn are WN fixed y coordinates. It is desired to 
perform a Fourier transformation of all N functions in the z direction, yielding 
an array of transforms 


Gfx) = [7 Ulz.me) exp (—72xfx2) der 


Neglecting the finite extent of the lens and object apertures, use the Fourier 
transforming and imaging properties of lenses derived in this chapter to show 
how this can be done with: 


(a) two cylindrical lenses of different focal lengths 
(6) a cylindrical and a spherical lens of the same focal length 


SIMPLIFICATION: You need only display |G,|?, so phase factors may be dropped. 


With reference to the approximation of Eq. (5-29): 


(a) At what radius r in the object plane has the factor exp E Od, (xo? + | 
“Yo 

changed by exactly 7 radians from its value at the origin? 

(6) Assuming a circular pupil function of radius a, what is the radius (in the 

object plane) to the first zero of h, assuming that the observation point in the 

image space is the origin? 


(c) From these results, what relation between a, \, and d, will allow the phase 


wilt 
factor exp E 2d, (Xo? + | to be dropped, assuming observation near the lens 


axis? 
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5-8 A normally incident, unit-amplitude, monochromatic plane wave illuminates 
a converging lens of 5 cm diameter and 2 meters focal length. One meter 
behind the lens and centered on the lens axis is placed an object with amplitude 
transmittance 


t(%o,Yo) = 44(1 + cos 2rfox.) rect (#) rect (#) 


Assuming L = 1 cm and f, = 100 cycles/em, sketch the intensity distribution 
across the z; axis of the focal plane, labeling the numerical values of the distance 
between the diffracted components and the width (between first zeros) of the 
individual components. 


Object 


ce 
pointe 


1m >| 


Figure P5-8 


5-9 A unit-amplitude, normally incident, monochromatic plane wave illuminates 
an object of maximum linear dimension /, situated immediately in front of a 
larger converging lens of focal length f. Due to a positioning error, the intensity 
distribution is measured across a plane at a distance f — A behind the lens. 
How small must A be if the measured intensity distribution is to accurately 
represent the Fraunhofer diffraction pattern of the object? 


fe ash 


l 


= 


i 


Figure P5-9 


5-10 A certain diffracting screen with an amplitude transmittance 


1 1 : r 
ur) = E ar ; sgn(cos ar) | cire (;) 


is normally illuminated by a unit-amplitude, monochromatic plane wave. 
Show that the screen acts as a lens with multiple focal lengths. Specify the 
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size of these focal lengths and the relative amounts of energy brought to a focus 
in the corresponding focal planes. (A diffracting screen such as this is known 
as a Fresnel zone plate.) 

HINT: The square wave shown in Fig. P5-10 can be represented by the Fourier 


series 
. sin (1n/2) _ 2nnx 
f(z) = ». ais oa EN Sire 
TN xX 
n=—0 
Ge), 
ae ll 
x 
2 
x 
Figure P5-10 
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6 FREQUENCY ANALYSIS OF 
OPTICAL IMAGING SYSTEMS 


Considering the long and rich history of optics, the tools of frequency 
analysis and linear systems theory have played important roles for only 
a relative short period of time. Nevertheless, in this short time these 
tools have been so widely and successfully employed that they now occupy 
a fundamental place in the theory of imaging systems. 

A realization of the utility of Fourier methods in the analysis of 
optical systems arose rather spontaneously in the late 1930s when a 
number of workers began to advocate the use of sinusoidal test patterns 
for system evaluation. Much of the initial stimulus was supplied by a 
French scientist, P. M. Duffieux, whose work culminated in the publica- 
tion of a book, in 1946, on the use of Fourier methods in optics [Ref. 6-1]. 
Unfortunately, this book has never been translated into English and is 
not widely available. In the United States, much of the interest in these 
topics was stimulated by an electrical engineer named Otto Schade, who 
very successfully employed the methods of linear systems theory and 
communication theory in the analysis and improvement of television 
camera lenses [Ref. 6-2]. However, the foundations of Fourier optics 
were in fact laid considerably earlier than 1940, particularly in the works 
of Ernst Abbe (1840-1905) and Lord Rayleigh (1842-1919). 

In this chapter, we shall consider the role of Fourier analysis in the 
theory of coherent and incoherent imaging. While historically the case 
of incoherent illumination has been the more important one, nonetheless 
the case of coherent illumination has always been important in micro- 
scope imagery and has gained additional importance since the advent of 
the laser. For further discussions of various aspects of the subject matter 
to follow, the reader may consult the books by O’ Neill [Ref. 6-3], Frangon 
[Ref. 6-4], Linfoot [Ref. 6-5], and Stroke [Ref. 6-6]. 
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Figure 6-1 Generalized model of an imaging system. 
6-1 GENERALIZED TREATMENT OF IMAGING SYSTEMS 


In the preceding chapter, the imaging properties of a single thin con- 
verging lens were studied for the particular case of monochromatic 
illumination. In the material to follow, we shall first broaden our dis- 
cussion beyond a single converging lens, finding results applicable to 
more general systems of lenses, and then remove the restriction of mono- 
chromatic illumination, obtaining results for ‘‘quasi-monochromatic”’ 
light, both spatially coherent and spatially incoherent. 


A generalized model 


Suppose that an imaging system of interest is composed, not of a single 
lens, but of several lenses, perhaps some positive and some negative. 
The lenses need not be “‘thin’”’ in the sense defined earlier. We shall 
assume, however, that the system ultimately produces a real image; 
this is actually not a restriction, for if a virtual image is to be viewed, 
it must eventually be converted to a real image, perhaps by the lens of the 
eye. We would, then, include this lens as the final element of our system 
in such a case. 

To specify the properties of a lens system, we adopt the point of view 
that all the imaging elements may be lumped into a single ‘‘black box,” 
and that the significant properties of the system can be completely 
described by specifying only the terminal properties of the aggregate. 

Referring to Fig. 6-1, the ‘terminals’ of this black box consist of 
an entrance pupil, representing a finite aperture (effective or real) through 
which light must pass to reach the imaging elements, and an exit pupil 
(again effective or real), representing a finite aperture through which 
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light must pass as it leaves the imaging elements on its way to the image 
plane. It is usually assumed that passage of light between the entrance 
and exit planes can be adequately described by geometrical optics. Thus 
the finite extent of the two pupils is found by geometrically projecting 
the smallest aperture of the system through the imaging elements onto 
the entrance and exit planes. The sizes of the resulting pupils may be 
smaller than the actual physical sizes of the apertures existing in the 
two planes, for they may correspond to the projection of an effectively 
smaller aperture that exists somewhere within the system. Note that by 
these definitions the entrance pupil is always a geometrical projection 
of the exit pupil, and vice versa. 

An imaging system is said to be diffraction limited if a diverging 
spherical wave, emanating from any point-source object, is converted 
by the system into a new wave, again perfectly spherical, that converges 
toward an ideal point in the image plane. Thus the terminal property 
of a diffraction-limited lens system is that a diverging spherical wave 
incident on the entrance pupil is mapped into a converging spherical wave 
at the exit pupil. For any real imaging system, this property will, at best, 
be satisfied over only a finite region of the object plane. If the object of 
interest is confined to this region, the system may be regarded as being 
diffraction limited. 

If, in the presence of a point-source object, the wavefront leaving 
the exit pupil departs significantly from ideal spherical shape, then the 
imaging system is said to have aberrations. Aberrations will be considered 
at greater length in Sec. 6-4, where it will be shown that they lead to 
defects in the spatial-frequency response of the imaging system. 


Diffraction effects 


Since geometrical optics adequately describes the passage of light from 
entrance pupil to exit pupil, diffraction effects play a significant role only 
during passage of light from the object to the entrance pupil and from 
the exit pupil to the image. It is, in fact, possible to associate all diffraction 
limitations with either of the two portions of the propagation path. The 
two points of view that regard image resolution as being limited by (1) 
the finite entrance pupil, or (2) the finite exit pupil are entirely equivalent. 
The fundamental reason for this equivalence lies in the fact that one pupil 
is simply a geometrical projection of the other. 

The view that regards diffraction effects as resulting from the finite 
entrance pupil was first espoused by Ernst Abbe in 1873 [Ref. 6-7]. 
According to the Abbe theory, only a certain portion of the diffraction 
components generated by a complicated object are intercepted by the 


104 / INTRODUCTION TO FOURIER OPTICS 


We 


Image 


Diffracted 
orders 


Figure 6-2 The Abbe theory of image formation. 


finite entrance pupil. The components not intercepted by the aperture 
are precisely those generated by the high-frequency portions of the object. 
This viewpoint is illustrated in Fig. 6-2 for the case of an object that is a 
simple grating and an imaging system that is composed of a single positive 
lens. 

A view essentially equivalent to regarding diffraction effects as 
resulting from the finite erzt pupil was presented by Lord Rayleigh in 
1896 [Ref. 6-8]. This is the viewpoint that was used in Sec. 5-3, and we 
shall again adopt it here. 

Again the image amplitude! is represented by a superposition integral 


Us(xiys) = ff W(ciyrstesye)UoleterYe) dito dy (6-1) 


where h is the amplitude at image coordinates (z:,y:) in response to a 
point-source object at (x.,y.). In the absence of aberrations, the response 
h arises from a spherical wave (of limited extent) converging from the exit 
pupil toward an ideal image point (7; = Ma, y; = My,.). In this more 
general case we allow the magnification WM to be either negative or positive, 
according to whether the image is or is not inverted. 

From the result of Prob. 4-3 and the discussions of Sec. 5-3, the 


1 We have retained the assumption of monochromatic illumination but will 
remove it in the section to follow. 
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distribution of light amplitude about the ideal image point is simply the 


Fraunhofer diffraction pattern of the exit pupil, centered on coordinates 
(Mx.,My.). Thus 


hcsueieay) = K ff Plow) 


_ 21 
exp |-0 | 


(x; — Mz.)x + (yi — My.)y]} dx dy (6-2) 
where K is a complex constant and the pupil function P is unity inside 
the pupil and zero outside the pupil. In writing this equation, we have 
again neglected the quadratic phase factors over the object and image 
planes, as justified in Sec. 5-3. With the changes of variables 


x y 
dd; 


— 


i=Mz 7% = My 
we obtain 
h(x; — To. (Seas Go) = Ki\2d? If P(\d€,d.:f/) 


exp {—j2x|(2i = £.)f + (ys — Go) 9)} dédy (6-3) 
With the final definitions 


ae 1 
~ KA2d2 
7d? Zo Yo 
U Lo; Yo = K U, ee ane. 
o(o,Go) M? G a) 
the superposition integral (6-1) becomes 
Uicwsys) = ff Ales — a, ys — Goof) dt dj (6-4) 


where U, is the ideal image predicted by geometrical optics and 


h(z:,y:) = if P(Ad:€,\d:4) exp [—j2r(aié + y.G)] dé dy (6-5) 
Thus in this more general case we may again regard the image as a 
convolution of the image predicted by geometrical optics with an impulse 
response that is determined by the exit pupil of the system. 
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Nonmonochromatic illumination: coherent and incoherent cases 


The assumption of strictly monochromatic illumination has been present 
in all our discussions of imaging systems. This assumption is overly 
restrictive, for the illumination generated by real sources (including lasers) 
is never perfectly monochromatic. The statistical nature of the time 
variations of illumination amplitude and phase can, in fact, influence the 
behavior of an imaging system in very profound ways. We therefore 
digress temporarily to consider the very important effects of nonmono- 
chromaticity. 

To treat this subject in a completely satisfactory way, it would be 
necessary to take a rather long detour through the theory of partial 
coherence. However, for our purposes such a detour would not be practical. 
We therefore treat the subject from two points of view, one entirely 
heuristic, and the second more rigorous but not entirely general. The 
reader interested in a more complete treatment should consult the book 
by Beran and Parrent [Ref. 6-9]. 

In the case of monochromatic illumination, it was convenient to 
represent the field amplitude by a complex phasor U that was a function 
of space coordinates. When the illumination is nonmonochromatic, this 
approach can be generalized by representing the fields by time-varying 
phasors that depend on both time and space coordinates. When the 
illumination is quasi-monochromatic (i.e., narrowband), the amplitude 
and phase of the time-varying phasor are readily identified with the 
envelope and phase of the optical wave. However the concept of a time- 
varying phasor can be generalized to apply to waves that are not quasi- 
monochromatic [e.g., Ref. 6-9, Chap. 2]. 

Consider the nature of the light transmitted or reflected by an object 
that is illuminated by a nonmonochromatic wave. Since the time varia- 
tions of the phasor amplitude are statistical in nature, only statistical 
concepts can provide a satisfactory description of the field. As we have 
seen previously, each object point generates an impulse response in the 
image plane. If the amplitude and phase of the light at an object point 
vary randomly with time, then the overall amplitude and phase of the 
impulse response will vary in a corresponding fashion. Thus the statistical 
relationships between the phasor amplitudes at various points on the 
object will influence the statistical relationships between the correspond- 
ing impulse responses in the image plane. These statistical relationships 
will greatly affect the result of the time-averaging operation that yields 
the final image intensity distribution. 

We shall consider only two types of object illumination here. First, 
we consider object illumination with the particular property that the 
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phasor amplitudes at all object points vary in unison; thus while any two 
object points may have different fixed relative phases, their absolute 
phases are varying with time in identical fashions. Such illumination is 
called spatially coherent. Second, we consider object illumination with 
the opposite property that the phasor amplitudes at all points in the 
object plane are varying in statistically independent or unrelated fashions. 
Such illumination is called spatially incoherent. (In the future, we shall 
refer to these types of illumination simply as coherent and incoherent.) 
Coherent illumination is obtained whenever the light appears to originate 
at a single point.! The most popular example of a source of such light is a 
laser, although more conventional sources (e.g., zirconium are lamps) 
can yield coherent light (albeit of weaker intensity than that of a laser) 
if their output is first passed through a pinhole. Incoherent light is 
obtained from diffuse or extended sources, such as gas discharges. ? 

When the object illumination is coherent, the various impulse 
responses in the image plane vary in unison and therefore must be added 
on a complex-amplitude basis. Thus a coherent imaging system is linear 
in complex amplitude. The results of our monochromatic analysis can 
therefore be directly applied, with the understanding that the complex 
amplitude U is now a time-invariant phasor amplitude that describes the 
relative amplitudes and phases of the light. 

When the object illumination is incoherent, the impulse responses in 
the image plane vary in statistically independent fashions. They must 
therefore be added on a power or intensity basis. Since the intensity of any 
given impulse response is proportional to the intensity of the object 
point that gave rise to it, it follows that an incoherent system is a linear 
mapping of intensity, and that the impulse response of the intensity 
mapping is proportional to the squared modulus of the coherent impulse 
response. 

The preceding arguments have been entirely heuristic, and in fact 
have certain assumptions and approximations buried in them. We there- 
fore turn to a more rigorous examination of the problem. To begin, note 
that in the monochromatic case we obtain the phasor representation by 
suppressing the positive-frequency component of the cosinusoidal field 


1 This is a sufficient but not a necessary condition. For example, when light from 
a point source is passed through a stationary diffuser, the relative phase of the light 
at any two points behind the diffuser remains time-independent. Therefore the trans- 
mitted light is still spatially coherent, even though it no longer appears to originate 
from a single point source. 

2 Alternative definitions of coherence in terms of quantum field theory are also 
possible [see, for example, Refs. 6-10, 6-11], but for our purposes the simplified con- 
cepts described above will suffice. 


108 / INTRODUCTION TO FOURIER OPTICS 


variation and assigning a complex amplitude to the remaining negative- 
frequency component. To generalize this concept to the nonmono- 
chromatic wave u(P,t), we suppress all positive-frequency components of 
its Fourier spectrum, yielding a new (complex) function u_(P,t). If we 


further write 
u_(P,t) = U(P,t) exp (—j27it) 


where 7 is the mean frequency of the disturbance, then the complex 
function U(P,t) may be regarded as the time-varying phasor representa- 
tion of u(P,t). 

The reader is now referred to the results of Prob. 3-5, from which it 
can be seen that, when two particular conditions are satisfied, the phasor 
U(P,t) obeys the same propagation laws as the time-independent phasor 
of a monochromatic disturbance at frequency ». The two required 
conditions are 


Av ; : . 
= KE II (the quasi-monochromatic assumption) 
Vv 


1 d 
and raptoete 
a Ap © 
where Av is the bandwidth of the disturbance, d is the length of the longest 
optical path involved, and c is the velocity of light. Under these two 
conditions, it follows directly that Eq. (6-4) may be used to represent 
the phasor amplitude of the image by the convolution equation 


Uslesysst) = ff Ale — fo, ys — Ge)Uo(orfost) €2, dy. (6-6) 


where U, is the phasor amplitude of the image predicted by geometrical 
optics, and h is given by Eq. (6-5) with X replaced by the mean 
wavelength X. 

It is, of course, the intensity 


Ti(xiyi) = (Us(ri, yet) UF (x,y: 5t)) (6-7) 


in the image plane that is of ultimate concern.! To find this quantity, 
we substitute (6-6) in (6-7) and interchange orders of integration and 
averaging, giving 


T(2xi,y:) = if de dYo lf dk, dijoh(x: = Lo, Up ee Yo) 
h* (x; Syl Yi — Yo) (Ug (Fo, Go jt) U* (%.,Ho5t)) (6-8) 


1 Again the angle brackets represent an infinite-time average. 
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Since the statistical properties of U, are identical with those of U,, the 
final quantity in angle brackets can be seen to depend on the spatial 
coherence of the object illumination. 

When the illumination is perfectly coherent, the phasor amplitudes 


across the object plane differ only by complex constants. Equivalently 
we may write 


U,(0,0;t) 
(|U,(0,0;t) |?) 
U,(0,0;t) 
(\U,(0,0;t)|*)” 


Ure ves!) = UT Seis) 
(6-9) 
UO (Go,4o3t) = U,(E,, Ho) 


where the time-independent U, are phasor amplitudes relative to the 
time-varying phasor amplitude at the origin. Substituting these relations 
in (6-8), the space integrations are found to separate, yielding 


Ti(eays) =| [f Ales — a0, ys — Go)Up(to,Ge) dts ago} (6-10) 


Finally, defining U;(z;,y:) as the phasor amplitude of U;(z,,y;;t) relative to 
the amplitude of U, at the origin, the coherent imaging system is found 
to be described by the convolution equation 


U;(2i,yi) = J ip A(x — €, Yi — Go) Uy (Fo,Ho) dE. dG (6-11) 


We conclude, then, that coherent object illumination yields an imaging 
system that is linear in complex amplitude. 

When the object illumination is perfectly incoherent, the phasor 
amplitudes across the object vary in statistically independent fashions. 
This idealized property may be represented by the equation 


(U5 Ge, Gost) US Het)? = KL, (Bo, fo) 8(2o 7 Bo, Yo ro Yo) (6-12) 


where «x is a real constant. Such a representation is not exact, however, 
for it can be shown to imply that the only waves transmitted by the object 
are evanescent. In actuality, the minimum distance over which coherence 
can exist is of the order of one wavelength. (See Ref. 6-9, sec. 4.4, for 
more details.) Nonetheless, for most practical purposes (6-12) is correct. 
When used in (6-8) the result 


Ii(xi,yi) = k If |h(zi — Bo, Yi — Go) |?L4(£o,Ho) dE. dijo (6-18) 
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is obtained. Thus the image intensity is found as a convolution of the 
ideal intensity 7, with an impulse response lhl? We conclude that when 
the object illumination is incoherent, the imaging system must be treated 
as a linear mapping of intensity. Furthermore, the impulse response of 
this intensity mapping is simply proportional to the squared modulus of 
the impulse response obtained with coherent illumination. 


6-2. FREQUENCY RESPONSE OF A DIFFRACTION-LIMITED 
COHERENT IMAGING SYSTEM 


We turn now to the more germane subject matter of this chapter, the 
frequency analysis of imaging systems. For the present, attention will be 
restricted to imaging systems with coherent illumination. Systems with 
incoherent illumination are treated in Sec. 6-3. 

As emphasized previously, a coherent imaging system is linear in 
complex field amplitude. This implies, of course, that such a system 
provides a highly nonlinear intensity mapping. If frequency analysis 
concepts are to be applied in their usual form, they must be applied to 
the linear amplitude mapping. 


The coherent transfer function 


Our analysis of coherent systems has yielded a space-invariant form of the 
amplitude mapping, as evidenced by the convolution equation 


Us(eiys) = ff Res = a, yx — GU p(,G.) €2 dj. (6-14) 


We would anticipate, then, that transfer-function concepts can be 
directly applied to this system. To do so, we define the following frequency 
spectra of the system input and output, respectively: 


Gi(fxfr) = ff Up(to,d.) exp [—j2n(fut, + frG.)] diy dy 
<a (6-15) 
Gi(fx,fy) = If Ui(xi,yi) exp [—j2r(fxa:i + fry:)] dx; dy; 


In addition, we define the transfer function as the Fourier transform of 
the space-invariant impulse response, 


Hfxfy) = ff Rasy) exp [—72e(faxs + fry] dei dys (6-16) 
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Now applying the convolution theorem to (6-14), it follows directly that 


Gilfx,fy) = H(fx,fy)G,(fx,fy) (6-17) 


Thus the effects of the diffraction-limited imaging system have been 
expressed, at least formally, in the frequency domain. In the future, the 
function H(fx,fy) will be referred to as the coherent transfer function. It 
remains now to relate H more directly to a physical property of the 
imaging system itself. 

To this end, note that while Eq. (6-16) defines H as the Fourier 
transform of h, this latter function is itself given by the Fourier transform 
relation [cf. Eq. (6-5)] 


(eam) = ff Plrdit,rdg) exp [—P2e(est + yeg)] at ag 


We conclude, then, that for a diffraction-limited system, 
H(fx,fr) = F{F{PAdz,rAd.¥) } } = P(—Adifx, —dd.fy) (6-18) 


This relation is of the utmost importance; it supplies very revealing 
information about the behavior of diffraction-limited coherent systems 
in the frequency domain. Since the pupil function P is always either unity 
or zero, the same is true of the coherent transfer function. This implies, 
of course, that there exists a finite passband in the frequency domain 
within which the diffraction-limited system passes all frequency com- 
ponents without amplitude or phase distortion.! At the boundary of this 
passband, the frequency response suddenly drops to zero, implying that 
frequency components outside the passband are completely attenuated. 

Finally, note that the coherent transfer function is proportional to 
the reflected pupil function, as implied by the minus signs in the argument 
of P. We can remove this notational awkwardness by hereafter defining 
P in a reflected coordinate system, rather than in the conventional coor- 
dinate system of Fig. 6-1. Thus we write 


H(fx,fy) = PQdfx,Adify) (6-19) 


The distinction will not be important to us here, for the problems to be 
considered are all symmetrical in the fx and fy coordinates. 


1 Note that this conclusion has been drawn only for a system free from aberra- 
tions. As we shall see in Sec. 6-4, an aberrated system is not distortion-free within 
its passband. 
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(a) (b) 


Figure 6-3 Coherent transfer functions for diffraction-limited systems with (a) 
square and (b) circular exit pupils. 


Examples of coherent transfer functions for diffraction-limited systems 


To illustrate the frequency response of diffraction-limited coherent 
imaging systems, consider the coherent transfer functions of systems with 
square (width /) and circular (diameter /) exit pupils. For these two cases 
we have, respectively, 


Ptakyy ="reet (5) rect (“) 
“del pokek oe 28 
P(z,y) = cire Ses 
2 


Thus, from (6-19), the corresponding coherent transfer functions are 


H(fx,fy) = rect (“) rect (a) 


Vie + =) 


yO ie ( 1/2nd, 


These functions are illustrated in Fig. 6-3. Note that a cutoff frequency 
fo can be defined in both cases by 


fo =e (6-20) 


where in the circular case this cutoff is uniform in all directions in the 
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frequency plane, while in the square case this cutoff applies only along 
the fx and fy axes. To illustrate a typical order of magnitude of f,, suppose 


that 1 = 2 cm, d; = 10 cm, and \ = 10-4 cm. Then the cutoff frequency 
is 100 cycles/mm. 


6-3 FREQUENCY RESPONSE OF A DIFFRACTION-LIMITED 
INCOHERENT IMAGING SYSTEM 


The relationship between the exit pupil and the coherent transfer function 
has been seen to be a very direct and simple one. When the object 
illumination is incoherent, the transfer function of the imaging system 
will be seen to be determined again by the exit pupil, but in a less direct 
and somewhat more interesting way. The theory of incoherent imaging 
has, therefore, a certain extra richness not present in the coherent case. 
We turn now to considering this theory; again attention will be centered 
on diffraction-limited systems, although the discussion that immediately 
follows applies to all incoherent systems regardless of their aberrations. 


The optical transfer function and its relation to the coherent transfer function 


Imaging systems that use incoherent illumination have been seen to obey 
the intensity convolution integral 


Ti(assys) = « ff \Rlws — ao, ys — Ge)|*To(do,Go) Af Ajo (6-21) 


Such systems should therefore be frequency-analyzed as linear mappings 
of intensity distributions. To this end, let the normalized frequency 
spectra of J, and J; be defined by 


co 


[ f Faldo.) exp [—j2e (fete + fre)] de die 
G(fzfr) == 7 (6-22) 
[f ToGo) dt. age 


i] 


ff T(ai,yi) exp [—j2a(fxxi + fryi)] dx; dy; 
Gi(fx.fy) = — = (6-23) 
lf I ;(x:,yi) dx; dy; 


The normalization of the spectra by their ‘zero frequency” values is 
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partly for mathematical convenience and partly for a more fundamental 
reason. Since intensity is a nonnegative quantity, an intensity distri- 
bution always has a nonzero “de component,” or constant background. 
The visual quality of an image is to a large extent dependent on the 
“contrast,” or the relative intensities of the information-bearing portions 
of the image and the ever-present background. Hence the spectra are 
normalized by that background. 

In a similar fashion, the normalized transfer function of the system 
can be defined by 


[f |h(x:,ys)|? exp [—j2a(fxx: + fry:)] dz: dy: 
xa sy) = — = (6-24) 
it |h(x.,ys)|? da: dy: 


Application of the convolution theorem to Eq. (6-21) then yields the 
frequency-domain relation 


Gi(fx.fy) = I(fx,fr)Go(fx,fy) (6-25) 


The function 3 is commonly known as the optical transfer function 
(abbreviated OTF) of the system. Its modulus || is known as the 
modulation transfer function (MTF). Note that #(fx,fy) simply specifies 
the complex weighting factor applied by the system to the frequency 
component at (fx,fy) relative to the weighting factor applied to the zero- 
frequency component. 

Since the definitions of both the coherent transfer function and the 
optical transfer function involve the function h, we might expect some 
specific relationship between the two. In fact, such a relationship exists 
and can readily be found with the help of the autocorrelation theorem of 
Chap. 2. Since 


H(fx,fy) = a : 
and KH(fx, fy) = 


it follows directly that 


[J He aDHe — fe, af — fe) ae af 
se(fx,fy) = — : C= 
Jf tHe a 2 ae! an’ 


FREQUENCY ANALYSIS OF OPTICAL IMAGING SYSTEMS / 115 


The simple change of variables 


results in the symmetrical expression 


ff (« +& n+o)H *(e-n-B) ara 


KH(fx,fy) = (6-27) 
fl |H(é,n)|? dé dy 


Equation (6-27) will serve as our primary link between the properties 
of coherent and incoherent systems. Note that it is entirely valid for 
systems both with and without aberrations. 


General properties of the OTF 


A number of very simple and general properties of the OTF can be 
derived directly from Eq. (6-27). The most important properties are 
listed below: 


ie oc Oe 
2. (—jx,—fr) = H*(fz,Jy) 
3. |3(fx,fy)| S |3¢(0,0)| 


Property 1 follows directly by substitution of (fx = 0, fy = 0) in Eq. 
(6-27). The proof of property 2 is left as an exercise for the reader, it 
being no more than a statement that the Fourier transform of a real 
function is hermitian. 

The proof that the MTF at any frequency is always less than its 
zero-frequency value of unity requires more effort. To prove property 3, 
we use Schwarz’ inequality [Ref. 6-12], which can be stated as follows: 
If X(é,n) and Y(é,n) are any two complex-valued functions of (&,n), then 


| [[ xvdgdnP s ff ixtaedn ff [veagdn (6-28) 


with equality if and only if Y = KX*, where K is a complex constant. 
Letting 


xen) =H (¢+20+2) and Vien) = Ht(e 29-2) 
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Ipa(erpet a) (tga g)ae 
s [f|n(e+ S44) 
[ ff iacemle ag anf 


— eo 


ll 


Normalizing by the right-hand side of the inequality, it follows that 
|3¢(fx,fy)| is never greater than unity. 

Finally, it should be pointed out that while the OTF is always unity 
at zero frequency, this does not imply that the absolute intensity level 
of the image background is the same as the absolute intensity level of 
the object background. In fact, the absolute intensity level is always 
decreased due to the finite entrance aperture of the system, but such a 
loss is not evident because of the particular normalization of the OTF. 


The OTF of an aberration-free system 


To this point, our discussions have been equally applicable to systems 
with and without aberrations. We now consider the special case of a 
diffraction-limited incoherent system. Recall that for the coherent 
system, we have 


H(fx,fy) = P(difx,Ad.fy) 


For the incoherent system, it follows from Eq. (6-27) (with a simple 
change of variables) that 


(fx, fy) 
y, difx ay aj X tJ ¥ 
[fe (en SS) ee ace 


2 2 2 


“a 


: (6-29) 
[[ Pn dean 


where, in the denominator, the fact that P equals either unity or zero 
has been used to replace P? by P. 

The expression (6-29) for 3¢ lends itself to an extremely important 
geometrical interpretation. The numerator represents the area of overlap 
of two displaced pupil functions, one centered at (Ad:fx/2, \d:fy/2) and 
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> ~— Adil fx | 


Area of 
overlap 


(a) (b) 
Figure 6-4 Geometrical interpretation of the OTF of a diffraction-limited system. 
(a) The pupil function—total area is the denominator of the OTF; 


(b) two displaced pupil functions—the shaded area is the numerator 
of the OTF. 


the second centered at the diametrically opposite point (—)d,fx/2, 
—ddify/2). The denominator simply normalizes this area of overlap by 
the total area of the pupil. Thus 


area of overlap 


K(fx,fy) = 


total area 


To calculate the OTF of a diffraction-limited system, the steps indicated 
by this interpretation can be directly performed, as illustrated in Fig. 
6-4. For simple geometrical shapes, closed-form expressions for the nor- 
malized area of overlap can be found (see examples to follow). For very 
complicated pupils, values of the OTF at a number of discrete frequencies 
can be calculated with the help of a planimeter or a digital computer. 

Finally, note that this geometrical interpretation demonstrates that 
the OTF of a diffraction-limited system is always real and nonnegative. 
It is not necessarily a monotonically decreasing function of frequency, 
however (see, for example, Prob. 6-1). 


Examples of diffraction-limited OTFs 


We consider now the OTFs that correspond to diffraction-limited systems 
with square (width /) and circular (diameter /) exit pupils. Figure 6-5a 
illustrates the calculation for the square case. The area of overlap is 
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L— ddilfx| 


ES 


1— rdilfy| 


ddi|fy| 


(Expanded view) 
Adil fx| 


(a) 


Figure 6-5 OTF of a diffraction-limited system with a square exit pupil. 
(a) Calculation of the area of overlap; (b) the resulting OTF. 


evidently 


Q@(fx,fy) = lfy| S l/dd; 


otherwise 


" — rdilfx|)(@ — ddilfr]) lfx| < U/dd; 


When this area is normalized by the total area /?, the result becomes 


re (=) A (=) (6-30) 


where A is the triangle function of Chap. 2, and f, is the cutoff of the same 
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system when used with coherent illumination 


l 
2dd; 


ie= 
Note that the frequency cutoff of the incoherent system occurs at fre- 


quencies 2f, along the fx and fy axes.! The OTF represented by Eq. (6-30) 
is illustrated in Fig. 6-5. 


When the exit pupil is circular, the calculation is not quite so straight- 


1 This should not be taken to imply that the incoherent system has twice the 
resolving power of the coherent system. See Sec. 6-5. 


ea 
es za 


(a) 


(b) 


Figure 6-6 OTF of a diffraction-limited system with a circular exit pupil. 
(a) Calculation of the area of overlap; (b) the resulting OTF. 
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forward. Since the OTF will clearly be circularly symmetric, it suffices to 
calculate 3¢ along the positive fx axis. As illustrated in Fig. 6-6a, the area 
of overlap may be regarded as being equal to four times the shaded area 
B of the circular sector A + B. But the area of the circular sector 1s 


Area (A + B) = | E (5) | A fee Sis) . ( y] 


while the area of the triangle A is 


tx 2 ) d; x 2 
smn“) 


Finally, we have 


4[area (A + B) — area (A)] 
(1/2)? 


or, for a general radial displacement p in the frequency plane, 


2 p p p i 
= | cos! a 2 < 2p 
5(p) eas eo ) 2 po : ea gag (6-31) 


0 otherwise 


K(fx,0) ae 


The quantity p. is the cutoff of the coherent system, 


ee! 
mea 


Po 


Referring to Fig. 6-6b, the OTF is again seen to extend to a frequency 
that is twice the coherent cutoff frequency. 


6-4 ABERRATIONS AND THEIR EFFECTS ON FREQUENCY RESPONSE 


In the development of a generalized model of an imaging system, it was 
specifically assumed that the presence of a point-source object yielded 
at the exit pupil a perfect spherical wave, converging toward the ideal 
geometrical image point. Such a system was called diffraction limited. 
We consider now the effects of aberrations, or departures of the exit-pupil 
wavefront from ideal spherical form. Aberrations can arise in a variety of 
ways, ranging from a defect as simple as a focusing error to inherent 
properties of perfectly spherical lenses, such as spherical aberration. 
A complete treatment of aberrations and their detailed effects on fre- 
quency response is beyond the scope of this development. Rather, we 
concentrate on very general effects and illustrate with one relatively 


FREQUENCY ANALYSIS OF OPTICAL IMAGING SYSTEMS / 121 


simple example. For a more complete treatment of various types of 


aberrations and their effects on frequency response, see, for example, 
Ref. 3-4, chap. IX. 


The generalized pupil function 


When an imaging system is diffraction limited, the (coherent) impulse 
response has been seen to consist of the Fraunhofer diffraction pattern of 
the exit aperture, centered about the ideal image point. This fact suggests 
a convenient artifice which will allow aberrations to be directly included 
in our previous results. Specifically, when wavefront errors exist, we can 
imagine that the exit pupil zs illuminated by an ideal spherical wave but 
that a phase-shifting plate exists within the aperture, thus deforming the 
wavefront that leaves the aperture. If the phase error at the point (x,y) 
in the exit pupil is represented by kW(z,y), where k = 27/\ and W is an 
effective path-length error, then the complex transmittance P of the 
imaginary phase-shifting plate is given by 


P(z,y) = P(a,y) exp [jkW(z,y)] (6-32) 


The complex function P may be referred to as the generalized pupil 
function. The impulse response of an aberrated coherent system is simply 
the Fraunhofer diffraction pattern of an aperture with transmittance P. 
The impulse response of an aberrated incoherent system is again, of 
course, the squared modulus of the coherent result. 


Effects of aberrations on the coherent transfer function 


When considering a diffraction-limited coherent system, the transfer 
function was found by noting that (1) the impulse response is the Fourier 
transform of the pupil function, and (2) the coherent transfer function 
is the Fourier transform of the impulse response. As a consequence of the 
two Fourier transform relations, the transfer function was found to be 
proportional to the pupil function P. Identical reasoning can be used 
when aberrations are present, provided the generalized pupil function P 
replaces P. Thus the coherent transfer function is written 


H(fx,fy) = P(Adifx,Adify) 
= P(dd,fx,Adify) exp [JkW (Adifx,Adify)] (6-33) 


Evidently the band limitation of the coherent transfer function, as 
imposed by the finite extent of the exit aperture, is unaffected by the 
presence of aberrations. The sole effect of aberrations is seen to be the 
introduction of phase distortions within the passband. Phase distortions 
can, of course, have a severe effect on the fidelity of the system. 


ll 
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There is little more of a general nature that can be said about the 
effects of wavefront aberrations on a coherent system. Again the result 
is a very simple one; as we shall now see, the result for an incoherent 
system is again more complex and, in many respects, more interesting. 


Effects of aberrations on the OTF 


Having found the effects of aberrations on the coherent transfer function, 
it is now possible, with the help of Eq. (6-27), to find the effects on the 
optical transfer function. To simplify the notation, the function @(fx,fy) 
is defined as the area of overlap of 


fe (« oe wes Vice ‘ee and P( ag & n+ ‘ete 


Thus the OTF of a diffraction-limited system is given, in this new 
notation, by 


‘i dé dn 
5(fx, fy) = a(fx, fy) 


When aberrations are present, substitution of (6-33) in (6-27) yields 


H( fx, fr) 
exp {ak | (+5, +*oh) 


2 
@(fx, fy) 


a(0;0) 
This expression allows us, then, to directly relate the wavefront errors 
and the OTF. 

As an important general property, it can be shown that aberrations 
will never increase the MTF (the modulus of the OTF). To prove this 


property, Schwarz’ inequality, Eq. (6-28), will be used. Let the functions 
X and Y of that equation be defined by 


exp | kW (¢ + MAU, 4 4 Mi ‘)) 


» 


“= a 


Y(é,n) = exp | sew (« ef, eas age ==) 


X(£,n) 


ll 


wy 


r4) a 
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Noting that |X|? = |Y|? = 1, it follows that 


\3e(fx,fy) [Feith aberrations 


: Nd fx dd; 2 
exp {ik |W (« + — ian af) 
@(fx,f¥) - y, 
Adi fx Ad; 
pid (: a - “~? coll cas 
_ Dy ye 
/ dé dn 
a(0,0) 
dt dn | 
(fx, fy) 2 
< —— = [3e(Fx, fy) |eithout aberrations 
[ adn 
a@(0,0) 


Thus aberrations will in general lower the contrast of each spatial- 
frequency component of image intensity. The absolute cutoff frequency 
remains unchanged, but severe aberrations will reduce the high-frequency 
portions of the OTF to such an extent that the effective cutoff is much 
lower than the diffraction-limited cutoff. In addition, aberrations can 
cause the OTF to have negative values in certain bands of frequencies, a 
result that never occurs for an aberration-free system. When the OTF is 
negative, image components at that frequency undergo a contrast 
reversal; i.e., intensity peaks become intensity nulls, and vice versa. An 
example of this effect will be seen in the section that follows. 


Example of a simple aberration: a focusing error 


One of the easiest aberrations to deal with mathematically is a simple 
error of focus. But even in this simple case, the assumption of a square 
aperture (rather than a circular aperture, of more practical interest) is 
needed to keep the mathematics simple. 

As illustrated in Sec. 5-3, the condition of focus is determined by the 
lens law 


- 1 1 
Cp iit ata Oh 
When the image plane is out of focus, the more general relation 
ae 
d; o 
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will hold. Comparing Eqs. (5-30) and (6-33), the aberration function W 
is seen to be 


(6-35) 


For a square aperture of width J, the maximum phase error at the 
edge of the aperture (along the z or y axis) is ekl?/8. Thus the maximum 
path-length error, which we represent by w, is given by 


él? 


i. (6-36) 


w= 


The number w is a convenient indication of the severity of the focusing 
error. 

If the aberration function W given by (6-35) is substituted in the 
expression (6-34) for the OTF, a number of straightforward manipulations 
yield the result 


CW eavure 
seine) = a(37)a(Fr) 


[EN 2] om O- ED] em 


A cross section of this OTF is shown in Fig. 6-7. Note that the diffraction- 
limited OTF is indeed obtained when w = 0. Note also that for values 
of w greater than one-half, sign reversals of the OTF occur. These reversals 
of contrast can readily be observed if the “spoke” target of Fig. 6-8a is 
used as the object. When the system is out of focus, a gradual attenuation 
of contrast and a number of contrast reversals are obtained for increasing 
spatial frequency (i.e., decreasing radius), as illustrated in Fig. 6-80. 

Finally, consider the form of the OTF when the focusing error is very 
severe (that is, w >>). In such a case, the frequency response drops 
toward zero for relatively small values of fx/2f. and fy/2f,. We may 
therefore write 


ee ie 


2fo fame 
and the OTF reduces to 


: 8 ) Xx 5 8 Y 
5(fx, fy) & sine E (=) sine E (=)| (6-38) 


The interested reader may verify (see Prob. 6-6) that this is precisely 
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H 


Figure 6-7 Cross section of the OTF for a focusing error and a 
square pupil. 


the OTF predicted by geometrical optics! More generally, when aberra- 
tions of any sort are very severe, the geometrical-optics predictions of the 
impulse response may be Fourier transformed to yield a good approxima- 
tion to the OTF of the system. 


6-5 COMPARISON OF COHERENT AND INCOHERENT IMAGING 


As seen in Secs. 6-2 and 6-3, the OTF of a diffraction-limited system 
extends to a frequency that is twice the cutoff of the coherent transfer 
function. It is tempting, therefore, to conclude that incoherent illumina- 
tion will invariably yield ‘‘better” images than coherent illumination, 
given that the same imaging system is used in both cases. As we shall now 
see, this conclusion is in general not a valid one; a comparison of the two 
types of illumination is far more complex than such a superficial examina- 
tion would indicate. 

A major flaw in the above argument lies in the direct comparison of 
the cutoff frequencies in the two cases. Actually, the two are not directly 
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Figure 6-8 (a) Focused and (b) misfocused images of a spoke target. 
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comparable. The coherent cutoff determines the maximum frequency 
component of the image amplitude, while the incoherent cutoff refers to 
frequency components of the image intensity; yet in both cases the final 
observable quantity is intensity. Surely any direct comparison of the two 
systems must be in terms of the same observable quantity, image intensity. 

Even when the quantity to be compared is agreed upon, the com- 
parison remains a difficult one for an additional fundamental reason: 
the term better has not been defined. Thus we have no universal quality 
criterion upon which to base our judgments. A number of potential 
criteria of this type might be considered (e.g., least-mean-square differ- 
ence between object and image intensities), but unfortunately the final 
interaction of a human observer is so complex and so little understood 
that a truly meaningful criterion is difficult to specify. 

In the absence of a meaningful quality criterion, we can only examine 
certain limited aspects of the two types of images, realizing that the 
comparisons so made will probably bear little direct relation to overall 
image quality. Nonetheless, such comparisons are highly instructive, for 
they point out certain fundamental differences between the two types of 
illumination. 


Frequency spectrum of the image intensity 


One simple attribute of the image intensity which can be compared in the 
two cases is the frequency spectrum. While the incoherent system is linear 
in intensity, the coherent system is highly nonlinear in that variable. 
Thus some care must be used in finding the spectrum in the latter case. 
In the incoherent case, the image intensity is given by the convolution 
equation 
I; = |h)? + I, = |h|? * |U,|? 


On the other hand, for the coherent case we have 
T; => |h * U2 


Let the symbol * represent the autocorrelation integral 


X(fx,fy) * X(fx,fy) & If X(E,n)X*(E — fx, n — fy) dE dn (6-39) 


Then we can directly write the frequency spectra of the image intensity 
in the two cases as 


Incoherent: ¥{I;} = (H * Hj[G, * G,] (6-40) 
Coherent: s{i;} = HG, * HG, 


where G, is the spectrum of U,, and H is the coherent transfer function. 
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The general result (6-40) does not lead one to the conclusion that 
one type of illumination is better than the other in terms of image fre- 
quency content. It does, however, illustrate that the frequency content 
can be quite different in the two cases, and furthermore shows that the 
results of any such comparison will depend very strongly on both the 
intensity and phase distributions of the object. 

To emphasize this latter point, we now consider two objects with the 
same intensity transmittance but different phase distributions, one of 
which can be said to be imaged better in coherent light and the other 
better in incoherent light. Let the intensity transmittance of the object 
be, in both cases, 

T(Lo,Yo) = Cos? 2nfro 


where we shall assume that 
i 
9 <Ghe<ay. 


f. being the coherent cutoff. The amplitude transmittances of the two 
objects are taken to be 


A: ta(Zo,Yo) = cos 2xfro (6-41) 
joe ts(20,Yo) = |cos 2xfz,| 


I 


Thus the two differ only by a periodic phase distribution. 

Figure 6-9 illustrates the various frequency-domain operations that 
lead to the image spectrum for object A. In all cases the imaging system 
is assumed to be diffraction limited. Note that the contrast of the image 
intensity distribution is poorer for the incoherent case than for the coherent 
case. Thus coherent illumination must be said to be better for this partic- 
ular object. 

The corresponding comparison for object B requires less detail. The 
object amplitude is now periodic with fundamental frequency 2f. But 
since 2f > f., no variations of image intensity will be present for the 
coherent case, while the incoherent system forms the same image it did 
for object A. Thus the incoherent illumination must be termed better in 
this second case. 

In summary, then, which particular type of illumination is better 
from the point of view of image spectral content depends very strongly 
on the detailed structure of the object, and in particular on its phase 
distribution. It is not possible to conclude that one type of illumination is 
to be preferred in all cases. The comparison is in general a complex one, 
although simple cases, such as the one illustrated above, do exist. For 
a second simple example, the reader is referred to Prob. 6-3. 
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Figure 6-9 Calculation of the spectrum of image intensity for object A. 


Two-point resolution 


A second possible comparison criterion rests on the ability of the respective 
systems to resolve two closely spaced point sources. The two-point resolu- 
tion criterion has long been used as a quality factor for optical systems, 
particularly in astronomical applications where it has a very real practical 
significance. 

According to the so-called Rayleigh criterion of resolution, two 
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—2.0 —1.0 0 1.0 2.0 
Figure 6-10 Image intensity for two incoherent point sources separated by the 
Rayleigh distance. 


incoherent point sources are ‘‘barely resolved” by a (diffraction-limited) 
system when the center of the Airy disk generated by one source falls on 
the first zero of the Airy disk generated by the second. The minimum 
resolvable separation of the geometrical images is therefore 


Ad; 
6 = 1.22 


(6-42) 


where / is the diameter of the exit pupil [cf. Eq. (4-18)]. Figure 6-10 
illustrates the image intensity distribution for this minimum separation. 
Note that the central dip is about 19 percent of the maximum intensity. 

We can now ask whether the two point-source objects, separated by 
the Rayleigh distance 6, would be easier or harder to resolve with coherent 
illumination than with incoherent illumination. This question is, of course, 
academic for astronomical objects, which are always incoherent. It is 
however, of some interest in microscopy, where coherent or incoherent 
illumination could be chosen at will. 

As in the previous examples, the answer to this question is found to 
depend on the phase distribution associated with the object. A cross 
section of the image intensity can be directly written, in normalized 
image coordinates, as 
joule Ji[r(x — 0.61)] es o Jale(x + 0.61)] |? 

(a — 0.61) a(t 0.01) 


where ¢ is the relative phase between the two point sources. Figure 6-11 
shows the distributions of image intensity for point sources in phase 
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(@ = 0°), in quadrature (@ = 90°), and in phase opposition (@ = 180°). 
When the sources are in quadrature, the image intensity distribution is 
identical to that resulting from incoherent point sources. When the objects 
are in phase, the dip in the image intensity is absent and therefore the 
two points are not as well resolved as for incoherent illumination. Finally, 
when the two objects are in phase opposition, the dip is greater than 19 
percent, and the two points are resolved better with coherent illumination 
than with incoherent illumination. Thus there can again be no generaliza- 
tion as to which type of illumination is preferred for two-point resolution. 


Other effects 


There are certain other miscellaneous properties of images formed with 
coherent light that should be mentioned in any comparison with inco- 
herent images [Ref. 6-13]. First, the responses of incoherent and coherent 
systems to sharp edges are notably different. Figure 6-12 shows the 
theoretical responses of a system with a square exit pupil to a step function 
object. Figure 6-13 shows actual photographs of the image of an edge in 
the two cases. The coherent system is seen to exhibit rather pronounced 
‘“ringing.’”’ This property is analogous to the ringing that occurs in video 
amplifier circuits with transfer functions that fall too abruptly. The 


I(x) 
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Figure 6-11 Image intensity for two mutually coherent point sources 
separated by the Rayleigh distance. 
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Figure 6-12 Theoretical intensity responses of coherent and incoherent systems 
to a step-function object. 


coherent imaging system has, of course, a transfer function with sharp 
discontinuities, while the falloff of the OTF is more gradual. From the 
plots of Fig. 6-12 it is also seen that the apparent positions of the edge 
(i.e., the positions of half-intensity response) differ for the two types of 
illumination, with slight errors introduced in the coherent case. 

In addition, we must mention the so-called speckle effect that is 
readily observed with highly coherent illumination. Figure 6-14 shows 
photographs of an object taken in coherent and incoherent light. The 
granular nature of the coherent image is a direct consequence of the 
optical roughness of the surface of the object and the high coherence of the 
laser source that was used for illumination (see, for example, Refs. 6-14 
and 6-15). The size of the individual speckles can be shown [Ref. 6-16] to 
be roughly the size of a single resolution cell on the object. Thus when the 
particular object of interest is near the resolution limit of the optical 
system, the speckle effect can be quite bothersome. Much of this problem 
can be eliminated by placing a piece of ground glass in the illuminating 
beam and moving it during the observation. The high coherence of the 
source is then partially destroyed and the speckles will to a large extent 
“wash out.” 

Finally, highly coherent illumination is particularly sensitive to 
optical imperfections that may exist along the path to the observer. For 
example, tiny dust particles on a lens may lead to very pronounced 
diffraction patterns that will be superimposed on the image. Again these 
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effects can be minimized by placing a moving scatterer, such as ground 
glass, in the illuminating beam. 


6-6 RESOLUTION BEYOND THE CLASSICAL DIFFRACTION LIMIT 


Until rather recently it had been widely accepted that diffraction effects 
represent the fundamental limits to optical system performance. To 
resolve beyond the classical diffraction limit was believed to be a hopeless 
task. Even an infinitely large lens would be limited by the evanescent- 
wave phenomenon to resolutions of the order of an optical wavelength. 
More recent work [Refs. 6-17 to 6-19] has shown, however, that for certain 
types of objects, resolution beyond the classical limit may indeed be 
feasible.! In fact, as we shall show in this section, for the class of spatially 
bounded objects, it is in principle possible to resolve with infinite precision. 


Fundamental mathematical reasons 


There exist very fundamental mathematical reasons why resolution 
beyond the classical limit should be possible. These reasons rest on two 
basic mathematical principles which we list here as theorems. For proof 
of these principles see, for example, Ref. 6-20. 


Theorem 1. The two-dimensional Fourier transform ofa spatially bounded 
function is an analytic function in the fxfy plane. 

Theorem 2. If any analytic function in the fxfy plane is known exactly 
in an arbitrarily small (but finite) region of that plane, then the 


1 Of particular interest is the work of G. Toraldo di Francia [Ref. 6-17], who used 
the theory of supergain antennas to demonstrate that the customary ‘‘Rayleigh limit” 
to resolution is of a practical, rather than a theoretical, nature. 


(b) 


Figure 6-13 Photographs of a step-function object for (a) coherent and (b) incoherent 
illumination. [By permission of P. S. Considine.] 


134 / INTRODUCTION TO FOURIER OPTICS 


fogeet> 


s eee Saree tee 


Figure 6-14 (a) Coherent and (b) incoherent images illustrating the speckle effect. 


(The object ts a photographic transparency illuminated through a 
diffuser.) 


entire function can be found (uniquely) by means of analytic 
continuation. 


Now for any imaging system, whether coherent or incoherent, the 
image information arises from only a finite portion of the object spectrum, * 
namely, that portion passed by the transfer function of the system. If this 
finite portion of the object spectrum can be determined exactly from the 
image, then, for a bounded object, the entire object spectrum can be found 
by analytic continuation. If the entire object spectrum can be found, 
then the exact object present can be reconstructed with infinite precision. 


A particular restoration scheme 


While the fundamental mathematical principles are most easily stated 
in terms of analytic continuation, there are a variety of restoration 
procedures that might be envisioned to attain the same result. A par- 
ticularly simple one has been discussed by J. L. Harris [Ref. 6-18]. 
Suppose that the object is bounded by a rectangle of sides Lx and Ly. 


1 By object spectrum we mean, of course, the spectrum of the object amplitude 
in the coherent case and the spectrum of the object intensity in the incoherent case. 
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Then by the Whittaker-Shannon theorem, the object spectrum G,(fx,fy) 
can be written! in terms of its sample values at (n/Lx, m/Ly): 


i] oO 


Gifafy)=- ) YG eS =) 


n=—%0o m=— 


sine [is @ — =)\ sine E (J _ a) (6-43) 


Now due to the limited passband of the optical system, values of 
G.(n/Lx, m/Ly) can be found for only a few low-integer values of (n,m). 
We would like, of course, to extend our knowledge to larger integer 
values, say (n = +N, m = +M), so that the approximation 


Go(fx,fy) = ) ee ie =) 


n=—N m=-M 


sinc E (Js - ~)| sine E (J _ *)| (6-44) 


would yield accurate values of G,(fx,fy) in at least some finite region 
outside the passband of the system. In other words, (6-44) will help us 
specify the object spectrum in at least some finite region outside the 
passband if enough of the sample values outside the passband can be 
determined. 

To determine the sample values outside the passband, Eq. (6-44) can 
be used in a second role: we measure the values of G, at any (2N + 1) 
(2M + 1) frequencies within the passband. If any one of these measured 
values is substituted for G, on the left of (6-44), then we have one equa- 
tion in the (2N + 1)(2M + 1) unknowns:? 


G.(2 7 ):n=0 =e os ee No Ar 1, di gE 


Thus the (2N + 1)(2M + 1) known values of G, generate (2N + 1) 
(2M + 1) linear equations in the (2N + 1)(2M + 1) unknowns. This 
set of equations can in principle be solved to yield an array of sample 
values that extends well outside the passband. The sampling theorem 
then allows us to approximate G, between the samples, and we have 


1 The analyticity of G, can be proved directly from (6-43) by expanding the 
sinc functions in a power series, thus demonstrating that G, also has a power-serles 
expansion. 

2 Actually, some of these unknowns may lie within the passband and therefore be 
known, but this simply makes the restoration easier. 
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succeeded in obtaining at least some of the object spectrum that ‘lies 
outside the passband. A Fourier transformation then yields an image 
with resolution better than the classical limit. 


Practical limitations 


The mathematical principles underlying the theory of object restoration 
are now well understood. It remains to be seen, however, what success 
can be achieved in the application of these principles to real optical 
systems. The fundamental difficulty rests on the noise that invariably 
accompanies any measurement of the object spectrum within the pass- 
band. Imprecision of the knowledge within the passband can be greatly 
amplified when the system of linear equations is solved for sample values 
outside the passband, for such sample values are expressed as linear 
combinations of all the measured values, each of which contributes some 
noise. The problem of object restoration remains a subject of active 
research, and only time will tell the degree to which these techniques 
can be successfully exploited in practice. 


PROBLEMS 


6-1 An incoherent imaging system has a square pupil function of width J. A square 
stop, of width //2, is placed at the center of the exit pupil, as shown in Fig. 
P6-1. 

(a) Sketch a cross section of the optical transfer function with and without 
the stop present. 

(b) Sketch the limiting form of the optical transfer function as the size of the 
stop approaches the size of the exit pupil. 


re | 
ae 


Figure P6-1 


6-2 An incoherent imaging system has a circular exit pupil of diameter J. A half- 
plane stop is inserted in the exit pupil, yielding the modified pupil shown in 
Fig. P6-2. Find expressions for the optical transfer function along the fx and 
fy axes. 


6-3 


6-4 


6-5 
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Figure P6-2 


An object with a square-wave amplitude transmittance (shown in Fig, P6-3) 
is imaged by a lens with a circular pupil function. The focal length of the 
lens is 10 em, the fundamental frequency of the square wave is 1,000 cycles/cm, 
the object distance is 20 cm, and the wavelength is 10-4 cm. What is the mini- 
mum lens diameter that will yield any variations of intensity across the image 
plane for the cases of 

(a) coherent object ilumination? 

(b) incoherent object illumination? 


i(x) 


| U(x, y) = t(x)-1 


‘a 


A sinusoidal test object with amplitude transmittance 


Figure P6-8 


(z,y) = 14(1 + cos 2nfz) 


is applied to a coherent imaging system. Assuming that the frequency f is low 
enough to be passed by the system, and neglecting the magnification and over- 
all attenuation of the system: 

(a) What is the intensity distribution across the image plane for an aberration- 
free system? 

(b) Show that this same intensity distribution appears across an infinity of 
misfocused image planes. 


A sinusoidal amplitude grating with amplitude transmission 
Ux,y) = 14 + 4 cos 2nfz 


is placed in front of a converging lens (circular of diameter J, focal length f) 
and obliquely illuminated by a monochromatic plane wave traveling at angle 6 
in the x,z plane, as shown in Fig. P6-5: 
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6-6 


6-7 


(a) What is the Fourier spectrum of the light-amplitude distribution trans- 


mitted by the object screen? 

(b) Assuming d, = d; = 2f, what is the maximum angle 6 for which any varia- 
tions of intensity will appear in the image plane? 

(c) Assuming that this maximum angle @ is used, what is the intensity distribu- 
tion in the image plane, and how does it compare with the corresponding 
intensity distribution for 6 = 0? 

(d) Assuming that the maximum angle @ is used, what is the maximum grating 
frequency f that will yield variations of intensity in the image plane? How 
does this frequency compare with the cutoff when 6 = 0? 


i k 2 
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Figure P6-5 


With reference to Eq. (6-38), show that the geometrical-optics prediction of 
the OTF of a misfocused system with a square pupil is 


H(fx,fy) = sine E (4) aoe E (=) 


where w is the maximum path-length error along the x or y axes and f, is the 
coherent cutoff frequency. 


An object has an intensity transmittance given by 
r(x,y) = }9(1 + cos 2afz) 


and introduces a constant, uniform phase delay across the object plane. This 
object is imaged by a positive lens of diameter J and focal length f, as shown 
in Fig. P6-7._ Compare the maximum frequencies f transmitted by the system 
for the cases of coherent and incoherent illumination. 


Figure P6-7 
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6-8 Sketch the fx and fy cross sections of the optical transfer function of an inco- 
herent imaging system having as a pupil function the aperture shown in Fig. 
P6-8. (Be sure to label the various cutoff frequencies on these sketches.) 


\ 
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Figure P6-8 
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6-9 A certain object has the property that all the frequency components of its 
amplitude transmittance lie on circles in the frequency plane, with the radii 
of the circles being given by 


Pm = V2ma i nO PRRs Yo 


Assuming uniform plane-wave illumination, and neglecting diffraction effects 
introduced by the finite size of the object, show that such an object is “self- 
imaging”’ in the sense that perfect images form at periodic distances behind 
the object (no lenses required). 


6-10 A quantity of considerable utility in determining the seriousness of the aberra- 
tions of an incoherent system is the Strehl definition D, which is defined as the 
ratio of the light intensity at the maximum of the point-spread function (impulse 
response) to the maximum of the point-spread function of the same instrument 
in the absence of aberrations. Prove that D is equal to the normalized volume 
under the optical transfer function of the aberrated instrument; that is, 


ff 5 fa,fy) 


—o 


ff KH fx, fy) 


—— 0 


aberrations 


D 


without dfx dfy 


aberrations 
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7 SPATIAL FILTERING AND OPTICAL 
INFORMATION PROCESSING 


The broad utility of linear systems concepts in the analysis of imaging 
systems is evident from the preceding chapter. However, if these concepts 
were useful only for analysis purposes, they would occupy a far less 
important position in modern optics than they in fact enjoy today. 
Their true importance comes into full perspective only when the exciting 
possibilities of system synthes?s are considered. 

There exist many examples of the benefits reaped by the application 
of linear systems concepts to the synthesis of optical systems. One class 
of such benefits has arisen from the application of frequency-domain 
reasoning to the improvement of various types of imaging instruments. 
Examples of this type of problem are discussed in their historical per- 
spective in Sec. 7-1. 

There are other equally important applications that do not fall in 
the realm of imaging as such, but rather are more properly considered 
in the general domain of information processing. Such applications rest 
on the ability of optical systems to perform general linear transformations 
of input data. In some cases a vast amount of data may, by its sheer 
quantity, overpower the effectiveness of a human observer. A linear 
transformation can then play a crucial role in the reduction of large 
quantities of data, yielding indications of the particular portions of the 
data that warrant the attention of the observer. An example of this type 
of application is found in the discussion of character recognition (Sec. 7-6). 
In other cases a body of data may simply not be in a form compatible 
with a human observer, and a linear transformation of the data may place 
it in compatible form. An example of this type of application is found in 
the discussion of data processing for synthetic aperture antennas 
(Sec. 7-7). 

The entire subject of optical information processing is too broad to 
be fully treated in any single chapter; in fact, more than one book devoted 
exclusively to the subject already exists [e.g., Refs. 7-1, 7-2]. We shall 
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Object Focal plane Image 


Figure 7-1 The Abbe-Porter experiment. 


attempt here only to indicate a few broad principles, choosing examples 
that illustrate those principles. In so doing, a number of interesting and 
useful techniques will unavoidably be omitted, but the interested reader 
may pursue the references to obtain a more complete picture of the field. 


7-1_ HISTORICAL BACKGROUND 


The history of Fourier synthesis techniques can be said to have begun 
with the first intentional manipulations of the spectrum of an image. 
Experiments of this type were first reported by Abbe in 1893 [Ref. 6-7] 
and later by Porter in 1906 [Ref. 7-3]. In both cases the express purposes 
of the experiments were verification of Abbe’s theory of microscope 
imagery and investigation of its implications. Because of the beauty and 
simplicity of these experiments, we discuss them here in some detail. 


The Abbe-Porter experiments 


The experiments performed by Abbe and Porter provide a powerful 
demonstration of the detailed mechanism by which coherent images are 
formed, and indeed of the most basic principles of Fourier analysis itself. 
The general nature of these experiments is illustrated in Fig. 7-1. An 
object consisting of a fine wire mesh is illuminated by a coherent source. 
In the back focal plane of the imaging lens appears the Fourier spectrum 
of the periodic mesh, and finally in the image plane the various Fourier 
components passed by the lens are recombined to form a replica of the 
mesh. By placing various obstructions (e.g., an iris, a slit, or a small stop) 
in the focal plane, it is possible to directly manipulate the spectrum of 
the image in a variety of ways. 

Figure 7-2a shows a photograph of the unmodified spectrum of the 
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mesh; Fig. 7-2b is the corresponding image. The periodic nature of the 
object generates in the focal plane a series of isolated spectral components, 
each spread somewhat by the finite extent of the circular aperture within 
which the mesh is confined. Bright spots along the horizontal axis in the 
focal plane correspond to complex-exponential components of the object 
that are directed horizontally (cf. Fig. 2-1); bright spots along the vertical 
axis correspond to vertically directed complex-exponential components. 
Off-axis spots correspond to components directed at corresponding angles 
in the object plane. 

The power of spatial filtering techniques is well illustrated by insert- 
ing a narrow slit in the focal plane to pass only a single row of spectral 
components. Figure 7-3a shows the transmitted spectrum when a hori- 
zontal slit is used. The corresponding image, seen in Fig. 7-3b, contains 
only the vertical structure of the mesh; it is precisely the horizontally 
directed complex-exponential components that contribute to vertical 
image structure. The suppression of horizontal structure is quite complete. 

When the slit is rotated 90° to pass the vertical spectral row of 
Fig. 7-4a, the image is seen to contain only horizontal structure. Other 
interesting effects can also be readily observed. For example, if an iris is 
placed in the focal plane and stopped down to pass only the on-axis 
Fourier component, then with a gradual expansion of the iris the Fourier 
synthesis of the mesh can be watched step by step. In addition, if the 


(a) (b) 


Figure 7-2 Photograph of the unmodified mesh and its spectrum. (a) Spectrum; 
(b) wmage. 
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Figure 7-3 Mesh filtered with a horizontal slit. (a) Spectrum; (b) image. 


iris is removed and a small stop is placed on the lens axis in the focal 
plane to block only the central-order or “zero frequency’? component, 
then a contrast reversal can be seen in the image of the mesh (see 
Prob. 7-1). 


The Zernike phase-contrast microscope 


Many objects of interest in microscopy are largely transparent, thus 
absorbing little or no light (e.g., an unstained bacterium). When light 


(a) (b) 


Figure 7-4 Mesh filtered with a vertical slit. (a) Spectrum; 
(b) image. 
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passes through such an object, the predominant effect is the generation 
of a spatially varying phase shift; this effect is not, of course, directly 
observable with a conventional microscope and a sensor that responds to 
light intensity. A number of techniques for viewing such objects have 
been known for many years; these include interferometric techniques, the 
central dark ground method in which a small stop is used to block only the 
central-order spectral component (see Prob. 7-2), and the Schlieren 
method in which all spectral components to one side of the central order 
are excluded (see Prob. 7-3). All these techniques suffer from the fact 
that the observed intensity variations are not linearly related to the phase 
shift and therefore cannot be taken as directly indicative of the thickness 
variations of the object. 

In 1935, Fritz Zernike proposed [Ref. 7-4] a new phase contrast tech- 
nique which rests on spatial-filtering principles and has the advantage 
that the observed intensity zs (under a certain condition to be discussed) 
linearly related to the phase shift introduced by the object. This develop- 
ment represents an early success of synthesis ideas and therefore will be 
treated in some additional detail. 

Suppose that a transparent object with amplitude transmittance 


is coherently illuminated in an image-forming system. For mathematical 
simplicity we assume a magnification of unity and neglect the finite 
extent of the exit and entrance pupils of the system. In addition, a 
necessary condition to achieve linearity between phase shift and intensity 
is that the phase shift ¢ be less than 1 radian, in which case the amplitude 
transmittance can be approximated by 


t(x,y) = 1 + 76(z,y) (7-2) 


We are, of course, neglecting terms of order ¢? and higher in this approxi- 
mation. Note that the first term of (7-2) leads to a strong wave component 
that passes through the sample without change, while the second term 
generates weaker diffracted light that is deflected away from the axis of 
the system. 

The image produced by a conventional microscope can be written 


I; |1+ je? =1 


where, to remain consistent with our approximation, the term ¢? has been 
replaced by zero. Zernike realized that the diffracted light is not okserv- 
able because it is in phase quadrature with the strong background, and 
that if this phase-quadrature relation could be modified, the two terms 
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might interfere more directly to produce observable variations of image 
intensity. Recognizing that the background is brought to a focus on-axis 
in the focal plane while the diffracted light—containing higher spatial 
frequencies—is spread away from the focal point, he proposed that a 
phase-changing plate be inserted in the focal plane to modify the phase 
relation between focused and diffracted light. 

The phase-changing plate can consist of a glass substrate on which a 
small transparent dielectric dot has been coated. The dot is placed at 
the center of the focal plane and has a thickness and index of refraction 
such that it retards the phase of the focused light by either 7/2 radians or 
3/2 radians relative to the phase retardation of the diffracted light. In 
the former case the intensity in the image plane becomes 


I; = exp [j@r/2)] + jo? = [70 + 4)[? S14 29 (7-3) 
while in the latter case we have 
I; = |exp [j(8/2)] + jel? = |-j — 4)? S1 — 2¢ (7-4) 


Thus the image intensity has become linearly related to the phase shift 
¢@. When the phase of the background is retarded by 7/2, the result (7-3) 
is known as positive phase contrast, while a 37/2 retardation is said to 
yield negative phase contrast [Eq. (7-4)]. It is also possible to improve the 
image contrast by making the phase-shifting dot partially absorbing 
(see Prob. 7-4). 

The phase-contrast method is one technique for converting a spatial 
phase modulation to a spatial intensity modulation. The reader with a 
background in electronics may be interested to note that one year after 
Zernike’s proposal a remarkably similar technique was proposed by E. H. 
Armstrong [Ref. 7-5] for converting amplitude-modulated electronic 
signals to phase-modulated signals. As we have seen in Chap. 6 and will 
continue to see here, the disciplines of electrical engineering and optics 
were to develop closer ties in the years to follow. 


Improvement of photographs: Maréchal 


In the early 1950s, workers at the Institut d’Optique, Université de 
Paris, became actively engaged in the use of coherent spatial-filtering 
techniques to improve the quality of photographs. Most notable among 
these workers was A. Maréchal, whose success with these techniques was 
to provide a strong motivation for future expansion of interest in the 
optical information-processing field. 

Maréchal regarded undesired defects in photographs as arising from 
corresponding defects in the optical transfer function of the imaging 
system that produced them. He further reasoned that if the photographic 
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transparencies were placed in the object plane of a second (coherent) 
system, then, by insertion of appropriate attenuating and phase-shifting 
plates in the focal plane, a compensating filter could be synthesized to at 
least partially remove the undesired defects. While the transfer function 
of the initial imaging system might be poor, the product of that transfer 
function with the transfer function of the compensating system would 
hopefully yield a frequency response that was more satisfactory. 

A variety of different types of improvements to photographs were 
successfully demonstrated by Maréchal and his coworkers. For example, 
it was shown that small details in the image could be strongly emphasized 
if the low-frequency components of the object spectrum were simply 
attenuated. Considerable success was also demonstrated in the removal 
of image blur. In the latter case, the original imaging system was badly 
defocused, producing an impulse response which (in the geometrical- 
optics approximation) consisted of a uniform circle of light. The corre- 
sponding transfer function was therefore of the form 

5e(p) & 2 Ji (ap) 

Tap 

where a is a constant and p = ~/fx? + fy?. The compensating filter was 
synthesized by placing an absorbing plate and a dephasing plate in the 
focal plane of the filtering system. The absorbing plate attenuated the 
large low-frequency peak of 3%, while the dephasing plate shifted the 
phase of the first negative lobe of # by 180°. The original and compensated 
transfer functions are illustrated in Fig. 7-5. 

As an additional example, it was shown that the periodic structure 
associated with the halftone process of printing photographs (as seen, 
for example, in newspapers) could be suppressed by a simple spatial 
filter. The halftone process is, in many respects, analogous to the periodic 
sampling procedures discussed in Sec. 2-3. The spectrum of a picture 
printed in this fashion has a periodic structure much like that illustrated 
in Fig. 2-5. By inserting an iris in the focal plane of the filtering system, 
it is possible to pass only the harmonic zone centered at zero frequency, 
thereby moving the periodic structure of the picture while passing all 
of the desired image structure. 


The emergence of a communication theory viewpoint! 
In the early 1950s it became evident that an exchange between the 


disciplines of communication theory and optics could reap high profits. 


1 We make no attempt here to mention all the early contributions to the wedding 
of the two viewpoints, but rather we concentrate on those references which will be 
of most interest to the reader with an electrical engineering background. 
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Figure 7-5 Compensation for image blur. (a) Focal-plane filter; (b) transfer 
functions. 


Many of the problems facing those working in optics bore strong resem- 
blance to the optimum filtering, detection, and estimation problems of 
communication theory. Much initial stimulation toward an exchange was 
provided by a communication theorist, Peter Elias, and his associates 
D.S. Grey and D. Z. Robinson, with the publication of a paper in 1952 
entitled ‘‘Fourier Treatment of Optical Processes” [Ref. 7-6], and again 
by Elias with the publication of the paper ‘Optics and Communication 
Theory” [Ref. 7-7] in 1953. The real wedding of the two viewpoints was 
provided by a physicist, E. L. O’ Neill, with the publication of his paper 
“Spatial Filtering in Optics” in 1956 [Ref. 7-8] and, more generally, 
through the great impact on the optics community of his research and 
teaching. That the marriage of these disciplines had been and was to 
continue to be a fruitful one was testified by the interest in and the quality 
of the work presented at a symposium entitled ‘‘Communication and 
Information Theory Aspects of Modern Optics” held in 1960 [Ref. 7-9]. 
Since that time the merger of the two points of view has become so com- 
plete that it is sometimes difficult to judge whether a particular piece 


of work should be published in an optics journal or an electrical engineer- 
ing journal. 
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Application of coherent optics to more general data processing 


While the early 1950s were characterized by a growing realization on 
the part of physicists that certain aspects of electrical engineering were 
of particular relevance to optics, the late fifties and early sixties saw a 
gradual realization on the part of electrical engineers that spatial-filtering 
systems might be usefully employed in their more general data-processing 
problems. The potentials of coherent filtering were particularly evident 
in the field of radar signal processing and were exploited at an early stage 
by L. J. Cutrona and his associates at the University of Michigan Radar 
Laboratory. The publication of the paper “Optical Data Processing and 
Filtering Systems” [Ref. 7-10] by the Michigan group in 1960 stimulated 
much interest in these techniques among electrical engineers and physicists 
alike. One of the most successful applications of coherent filtering in the 
radar realm has been to the processing of the data collected by “synthetic 
array’ antennas [Ref. 7-11], a subject that will be briefly treated in 
Sec. 7-7. A survey of literature from the mid-1960s shows successful 
application of coherent processing techniques in such widely diverse 
fields as, for example, Fourier spectroscopy [Ref. 7-12] and seismic-wave 
analysis [Ref. 7-13]. 


7-2. PHOTOGRAPHIC FILM 


Photographic film has been and continues to be a basic component of 
optical processing and filtering systems. It can play three very funda- 
mental roles in the filtering process. First, it can serve as the medium on 
which input data are fed into the optical system for processing. Second, it 
can serve as the medium on which the filter information is impressed; for 
example, it may provide the attenuations required in the frequency plane 
for synthesis of a desired transfer function. Third, it may provide the 
means of recording the output of the processing system. Often it plays 
all three roles. 

There exist a number of other media which have been used on a 
limited basis to replace film in one or more of its roles. These media 
include liquid sonic cells, deformable oils, thermoplastic tape, and photo- 
chromic films and glasses. There is every evidence that these media will 
grow in importance, but it is also highly doubtful that film will ever be 
totally replaced as a primary element of optical information-processing 
systems. 

For these reasons, we now briefly detour to discuss some fundamental 
properties of photographic film. For more comprehensive treatments, the 
reader may consult Refs. 7-14 and 7-15. 
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The physical process of exposure 


An unexposed photographic film or plate generally consists of a multitude 
of tiny silver halide grains suspended in a gelatin support, which is in 
turn attached to a firm “base” consisting typically of acetate for films 
and glass for plates. When the photosensitive material is exposed to light, 
the silver halide grains absorb optical energy and undergo a complex 
physical change. Those grains that have absorbed a sufficient amount 
of energy are found to contain tiny patches of metallic silver; the patches 
are referred to as development centers. The exposed film is then subjected 
to the chemical process called development, during which the existence of 
a single tiny development center can precipitate the change of the entire 
silver halide grain to metallic silver. The grains that do not contain 
development centers do not undergo such a change. Finally, the film is 
“‘fixed’’ by subjecting it to chemical processing that removes the remaining 
silver halide grains while leaving the metallic silver, thus preventing the 
later deterioration of clear grains into metallic silver. The silver grains 
are, of course, largely opaque at optical frequencies; the opacity of the 
developed film or plate will thus depend on the density of silver grains 
in each region of the transparency. 


Definition of terms 


The exposure E to which a film or plate is subjected is defined as energy 
per unit area incident at each point on the photosensitive surface. If the 
intensity incident is represented by! J and the duration of the exposure is 
T seconds, we have 


E =9gT 
The intensity transmittance r(z,y) of the developed transparency is 
defined by 


I [transmitted at (z,y)] 
I [incident at (zx,y)] 


t(v,y) = local average 


where the local average is over an area large compared with the size of a 
film grain, but small compared with an area within which the transmitted 
intensity changes significantly. 

In the year 1890, F. Hurter and V. C. Driffield published a classic 
paper in which they demonstrated that log (1/r) should be proportional 
to the silver mass per unit area of a developed transparency. They 


1 We shall consistently use g to refer to intensities of interest before or during 
exposure, and J for intensities of interest after development. 
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accordingly defined the photographic density D as 


D = log (*) (7-5) 


A plot of photographic density vs. the logarithm of exposure is the 
most widely used description of the photosensitive properties of photo- 
graphic film, and is commonly referred to as the Hurter-Driffield curve 
(or for brevity, the H&D curve). Figure 7-6 illustrates a typical H&D 
curve for a photographic negative. When the exposure is below a certain 
level, the density is independent of exposure and equal to a minimum 
value referred to as gross fog. In the “toe” of the curve, density begins to 
increase with increasing exposure. There follows a region of the curve in 
which density is linearly proportional to logarithmic exposure; the slope 
of this linear region of the curve is referred to as the film gamma (y). 
Finally, the curve saturates in a region called a shoulder, and again there 
is no change in density with increasing exposure. 

The linear region of the H&D curve is the portion generally used in 
conventional photography. A film with a large value of y is called a 
high-contrast film, while a film with a low y is called a low-contrast film. 
Figure 7-7 illustrates the H&D curves of low- and high-contrast films. 
The particular value of y realized is influenced by three major factors: 
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contrast 


log E 
Figure 7-7 H&D curves of high- and low- 
contrast films. 


(1) the type of emulsion! in question (for example, Plus-X and Tri-X are 
low-contrast films, with gammas of 1 or less, while High Contrast Copy 
has a gamma of 2 or 3); (2) the particular developer used; and (3) the 
development time. Figure 7-8 illustrates a typical dependence of y on 
development time. With a judicious choice of film, developer, and develop- 


1 The term emulsion refers to the light-sensitive portion of the film, thus excluding 
the base. 
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Figure 7-8 The dependence of film gamma on 
development time. 
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ment time, it is possible to achieve a prescribed value of y with a fair 
degree of accuracy. 


Film in an incoherent optical system 


In many of its uses, film may be regarded as an element that maps an 
intensity distribution J incident during exposure into an intensity distri- 
bution J transmitted after development. Such a point of view is partic- 
ularly appropriate when film is used as an element of an incoherent 
optical system. We consider now the detailed nature of the mapping so 
defined. 

Assuming that the film is used in the linear region of the H&D curve, 
the density D may be written 


D = y, log E — Do = yn log (ST) — D. (7-6) 


where jy, is the slope of the linear region of the curve and — D, is the value 
of D where the straight-line approximation would meet the D axis if 
there were no toe. The subscript m on y is used to indicate that we are 
dealing with a negative transparency. 

The intensity incident during exposure can be related to the intensity 
transmittance after development by recalling the definition of photo- 


graphic density, 
1 
D = log (+) 
Tn 


When this definition is substituted in (7-6), we find 


log tT = —vyn log (§T) + D. 
or equivalently, t™ = 102-($T)-% 
Finally, Tm = KyI7~%™ (7-7) 


where K,, is a positive constant. Note that the intensity mapping defined 
by this relation is a highly nonlinear one for any positive value of film 
gamma. 

It is also possible to achieve a positive power-law relation between 
intensity transmittance and intensity incident during exposure, although 
to do so generally requires a two-step photographic process. During the 
first step, a negative transparency is made in the usual fashion. During 
the second step, the light transmitted by the negative transparency is 
used to expose a second film, and the result is a final positive transparency. 
To understand this process more quantitatively, let the transmittance of 
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the first transparency be written, from (7-7), 
oh Kysg7™ 


If this transparency is placed in contact with a second piece of unexposed 
film and illuminated with intensity J,, then the intensity applied to the 
second film is simply 7n1J,, and the resulting intensity transmittance 
becomes 


Tp a Kn2(Lotn1)—™ — Kaal, UK gy Stes 
or equivalently, tp = K,I% (7-8) 


where K, is a positive constant and yp = Yn1Yn2 is the overall gamma of 
the two-step process. Evidently a positive transparency does provide a 
linear mapping of intensity when (and only when) the overall gamma is 
unity. 

While a linear mapping of intensity incident during exposure into 
intensity transmitted after development has been seen to occur only under 
very special conditions, nonetheless film can be shown to provide a linear 
mapping of incremental intensity variations under a much wider class of 
conditions. For further development of this point, the reader is referred 
to Prob. 7-5. 


Film in a coherent optical system 


When film is used as an element of a coherent optical system, it is more 
appropriately regarded as providing either (1) a mapping of intensity 
incident during exposure into complex amplitude transmitted after 
development, or (2) a mapping of complex amplitude incident during 
exposure into complex amplitude transmitted after development. The 
second viewpoint can be used, of course, only when the light that exposes 
the transparency is itself coherent. 

Since the complex amplitude of the transmitted light is, from both 
viewpoints, the important quantity in a coherent system, it is necessary 
to describe a transparency in terms of its complex amplitude transmittance 
t. It is most tempting to define t simply as the positive square root of 
the intensity transmittance 7. However, such a definition neglects the 
relative phase shifts that can occur as the light passes through the film. 
Such phase shifts arise as a consequence of variations of film thickness, 
which can originate in two distinct ways. First, there are generally 
random variations of thickness across the base of the film, i.e., the base 
is not optically flat. Second, the thickness of the emulsion is found to 
vary in accordance with the density of silver in the developed trans- 
parency. This latter variation is deterministic and strongly dependent 


SPATIAL FILTERING AND OPTICAL INFORMATION PROCESSING / 155 


Index-mateching oil 


N 


Sy 


Ss 


ies 
2 
5 


ibaa 
LLL hod 


LL 


CLL Ela, 


Optical flat 


ha 
LLM 


LIS EI Ldda 
Li 


LD 


ib 
P72 


Ci 


Figure 7-9 A liquid gate for removing film 
thickness variations. 


upon the exposure variations to which the film is subjected. Thus a 
complete description of the amplitude transmittance of the film must be 
written 


t(x,y) = Vr(2,y) exp [id(z,y)] (7-9) 


where ¢(x,y) describes the pattern of phase shifts introduced by the 
transparency. 

In most applications the thickness variations are entirely undesired, 
for they cannot easily be controlled. It is possible to remove the effects of 
these variations by means of a device called a liquid gate. Such a device 
consists of two optical flats, between which the transparency and an 
index-matching oil can be sandwiched, as illustrated in Fig. 7-9. The index 
of refraction of the oil must be chosen by compromise, for it is impossible 
to match simultaneously the different indices of the base, the emulsion 
and the glass (which is usually flat on only one side). However, with a 
proper choice of oil, the optical path length through the liquid gate can 
be made nearly constant, allowing the amplitude transmittance of the 
film and gate to be written 


t(x,y) = Vr(2,y) (7-10) 


When the phase shifts have been removed, a combination of Kas. 
(7-10) with (7-7) and (7-8) allows the amplitude transmittance to be 
expressed as 


kn—%!2 = k,|U|-% 
kyd»!? ae k,|U|% 


Negative transparency: ike 


uy (7-11) 
Positive transparency: t; 
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Figure 7-10 Typical transmittance-exposure curve 
for a negative transparency. 


where U is the complex amplitude of the field incident during exposure, 
and kaga i oe Roem 

As will be seen in many of the examples to be discussed in later sec- 
tions, it is often desirable to have film act as a square-law mapping of 
complex amplitude. Such behavior can be achieved in a number of ways, 
one of which is to make a positive transparency with an overall gamma 
(yp) of 2, as can be seen from Eq. (7-11). In order to obtain a maximum 
dynamic range of exposure over which this relation holds, the first gamma 
of the two-step process is often chosen less than unity (for example, 
Yn1 = 14), and the second chosen greater than 2 (for example, yn2 = 4) 
such that their product remains equal to 2. 

It is possible, however, to obtain square-law action over a limited 
dynamic range with a transparency of any gamma, be it a positive or a 
negative. This point is most easily seen by abandoning the traditional 
H&D curve description of film and making instead a direct plot of ampli- 
tude transmittance vs. exposure (on a linear scale). Such a description 
was advocated by Maréchal [Ref. 7-9, p. 48] and was very successfully 
used by Kozma [Ref. 7-16] in an analysis of the effects of photographic 
nonlinearities. Figure 7-10 shows a plot of amplitude transmittance vs. 
exposure (the {-H curve) for a typical negative transparency. If the film 
is “biased” to an operating point that lies within the region of maximum 
linearity of the t-# curve, then over a certain dynamic range the film will 
provide a square-law mapping of incremental changes of amplitude. 
Thus if #, represents the bias exposure and t the corresponding bias 
transmittance, we may represent the t+# curve within its region of 
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linearity by 
t,t + B(E — Es) =t+ B’|AU|? (7-12) 


where 6 is the slope of the curve at the bias point, AU represents the 
incremental amplitude changes, and §’ is the product of 6 and the exposure 
time. Note that 8 and #’ are negative numbers for a negative transparency. 

Figure 7-11 compares the ¢t-# curves for high-gamma and low-gamma 
films. Note that the high-gamma curve has a steeper slope and therefore 
is more efficient in transferring small changes of exposure into changes of 
transmittance. However, this increased efficiency is accompanied by a 
smaller dynamic range of exposure over which the t-H curve remains 
linear. As a final point of interest, the bias at which maximum dynamic 
range is obtained is generally found to fall near the toe of the correspond- 
ing H&D curve. 

The reader interested in additional discussions of the behavior of 
photographic film in coherent systems should consult Refs. 7-17 and 7-18. 


The modulation transfer function 


To this point, we have tacitly assumed that any variations of exposure, 
however fine on a spatial scale, will be transferred into corresponding 
variations of density according to the prescription implied by the H&D 
curve of the emulsion. In practice, one finds that when the spatial scale 
of the exposure variations is too small, there may be no corresponding 
changes of density produced in the final transparency. We can say in very 
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Figure 7-11 Transmittance-exposure curves for 
high-gamma and low-gamma films. 
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general terms, then, that a given film must have a limited frequency 
response. 

As seen previously, film generally behaves in a very nonlinear fashion, 
and it is therefore certainly not proper to associate a transfer function 
with the film mapping. However, the physical phenomena that limit the 
frequency response are, for the most part, linear (e.g., scattering of light 
in the emulsion during exposure and chemical diffusion during develop- 
ment). Thus it might be hoped that the linear phenomena which limit 
frequency response could be separated from the nonlinear properties. 
Such an approach is, in fact, quite accurate in most cases. Thus it is 
generally proper to regard the photographic mapping as a cascade of a 
linear space-invariant filter having a certain transfer function, followed 
by a so-called memoryless nonlinearity which has all the properties 
discussed in the preceding sections.! 

To measure the detailed characteristics of the linear filter associated 
with the film, a cosinusoidal exposure 


E = Ey + E; cos 2nfx (7-13) 


may be applied. The ‘‘modulation”’ associated with this exposure is defined 
as the ratio of the peak variations to the background level, that is, 


ws 


M; = 
Ey 


(7-14) 
If the variations of density in the resulting transparency are measured, 
they can be referred back through the known H&D curve to yield an 
inferred or “‘effective’’ cosinusoidal exposure distribution, as indicated in 
Fig. 7-12. The modulation Met of the effective exposure distribution will 
always be less than the modulation M; of the true exposure distribution. 
Accordingly, the modulation transfer function of the film is defined as 


M(f) 
M x(f) 


where the dependence on the spatial frequency of the exposure has been 
emphasized. In addition, depending on the detailed shape of the point- 
spread function or impulse response of the linear portion of the film 
mapping, the effective exposure distribution may suffer a phase shift 
Q(f) with respect to the true exposure distribution. Thus the frequency 


MG)i= 


1 A more accurate representation is the cascade of a linear filter, a memoryless 
nonlinearity, and a second linear filter. However, for our purposes the more simple 
representation described above will suffice. 
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Figure 7-12 Measurement of the effective exposure distridution. 


response of the linear portion of the film mapping may be described by 
the complex transfer function 


M(f) = M(f) exp [—jQ(f)] 


The effective exposure distribution applied to the nonlinear portion of 
the film mapping may therefore be written 


E = E, + M(f)Fi cos [2rfx — 2(f)] (7-15) 


Figure 7-13 illustrates the typical frequency dependence of the MTF 
of a film. The range of frequencies over which significant response is 
obtained varies widely from film to film, depending on grain size, emulsion 
thickness, and other factors. By way of illustration, Plus-X film has 
significant response out to about 50 lines/mm, while that for Kodak 649F 
spectroscopic plate extends to at least 2,500 lines/mm. 


7-3 INCOHERENT PROCESSING SYSTEMS BASED ON 
GEOMETRICAL OPTICS 


An important class of optical processing systems can be characterized 
by a complete reliance on geometrical-optics reasoning for design. Such 
systems have, for the most part, been constructed for use with incoherent 
illumination. They are comparatively simple in their construction and 
operation, but as we shall see, this simplicity is sometimes achieved at 
the price of a limited data-handling capacity. The systems discussed 
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Figure 7-13 Typical modulation transfer function 


of film. 


below should be regarded only as examples of a particular design 
philosophy. 


Image casting 


One of the most straightforward of optical processing systems can be 
designed using simple image-casting properties of lenses. Such a system 
was proposed as early as 1927 by Emanuel Goldberg of Dresden, Germany, 
in a German patent application. Goldberg, who was granted a U.S. patent 
in 1931 [Ref. 7-19], fully recognized the potential application of his 
invention in the field we now call character recognition. 

The principles underlying the image-casting system are straightfor- 
ward. If a transparency with intensity transmittance 71 is imaged onto a 
second transparency with transmittance 72, then the intensity at each 
point behind the second transparency is proportional to the product 717». 
A photodetector can be used to measure the total intensity transmitted 
through the pair of transparencies, yielding a photocurrent J that is 
given by! 


eae il 71(x,y)T2(z,y) dx dy (7-16) 


Two means of achieving this operation are illustrated in Fig. 7-14. 


1 In writing infinite limits of integration we have assumed that the finite extents 
of the transparencies are included in the definitions of the 7’s. 
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For the technique (a), the lens 1; casts a magnified image of the uniform 
incoherent source S onto the two transparencies which are placed in 
direct contact. The lens ZL. then casts a demagnified image of the light 
transmitted by rz onto the small photodetector D. The photocurrent is 
then given by Eq. (7-16). 

If one or both of the transparencies must be changed rapidly, there 
are mechanical difficulties associated with the close contact of the 
transparencies in the previous configuration. It is then advantageous to 
separate the two transparencies physically, as illustrated in Fig. 7-14b. 
The lens L; again casts a magnified image of the source onto 7;. Lens Le 
images 7; onto 72, and lens L3 casts a demagnified image of the light 
transmitted by 72 onto the detector. Note that the transparency 7; must 
be inserted in a reflected geometry to compensate for the inversion 


T1(2, y) 


\ fx 
p= 


(a) 


ix Ly Ls 


(b) 
Figure 7-14 Systems for realizing the integral of a product. 
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introduced by the imaging operation of LZ». Again the photocurrent is 
given by Eq. (7-16). 

While the operation represented above is a useful one in a number of 
applications, including character recognition, it is often desirable to 
realize the related but more general operation of convolution. A one- 
dimensional convolution of the functions 7; and 72 can be realized with 
either of the above systems by moving one of the transparencies with 
uniform velocity and measuring the photodetector response as a function 
of time. More specifically, with reference to Fig. 7-140, let the trans- 
parency 72 be introduced in a reflected geometry such that the operation 
(7-16) becomes 


easy ful 71(X,Y)T2(—2,—y) dx dy 


If 7. is moved in the negative x direction with velocity v, the detector 
response is given by 


IQ) =k ff r(xy)ra(ot — 2, —y) de dy 


If the scans in the zx direction are repeated sequentially, each for a 
different y displacement —y,,, then the detector responses are 


Ta = k i rier — vay. — 4) dade meal 2) 3 ee 


The array of functions [,,(¢t) represents a full two-dimensional convolu- 
tion, albeit sampled in the y displacement. 


Convolution without motion 


The preceding technique for performing convolutions is extremely 
awkward and time-consuming due to the mechanical scanning required. 
It is possible to perform the same operation without relative motions if 
the optical configuration is modified [Ref. 7-20]. Referring to Fig. 7-15, 
let the distributed source S be placed in the front focal plane of the lens 
Ly. Immediately behind Ly is placed the transparency with transmittance 
71(—a,—y). At a distance d from 7; and immediately in front of lens Ls, 
the transparency 72(+2,+y) appears. The intensity distribution across 
the back focal plane of Lz is then measured, perhaps with film, although 
the use of an electronic detector (such as a vidicon) is also possible. 

To understand the operation of this system, consider first the light 
generated by the particular source point with coordinates (—z,,—y;). 
The rays from that point emerge from L; (and 7;) parallel with each other 
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Figure 7-15 System for performing convolution without motion. 


and illuminate 72. with an intensity distribution proportional to 
mi[—2 + (d/f)x., —y + (d/f)y,]. After passing through r. the rays are 
focused onto the detector at coordinates (2,,y;), Where we have assumed 
that the two lenses have identical focal lengths. Thus the intensity 
distribution across the detector may be written 


y d d 
I(@a = Xs, Ya = Ys) = JJ 71 € Be — Bye — ) t2(x,y) dx dy (7-18) 


which is the desired convolution. 


Impulse-response synthesis with a misfocused system 


Direct synthesis of a desired impulse response is possible by means of 
the ‘‘misfocused”’ system illustrated in Fig. 7-16! [Ref. 7-21]. The lens Li 
again images the distributed source S onto the “input” transparency 


1 This system is in fact nearly identical with that of Fig. 7-15, but the point of 
view is sufficiently different to warrant a separate discussion. 
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Figure 7-16 Impulse-response synthesis with a misfocused 
system. 
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with transmittance 71. The lens L2 forms an image of 7; across the plane 
P’. For simplicity we assume that 71 and P’ are each a distance 2f from 
the lens Ls, thus yielding a magnification of unity. The transparency 72, 
having the form of the desired impulse response, is inserted directly 
against L2; the system output is found across the plane P, located a 
distance A from the image plane P’. 

The operation of this system is most easily understood by applying 
a unit-intensity point source at coordinates (x,y) on 7; and finding the 
resulting intensity distribution across P. In the geometrical-optics 
approximation, the rays passing through 72 converge to an ideal point in 
plane P’, and then diverge to form a demagnified projection of 72 in 
plane P. The projection is centered at coordinates {zz = —[1 + (A/2f)]z, 
ya = —(1 + (4/2f)]y} and the demagnification of r2 is A/2f. Taking into 
account the reflection of rz when projected, the response to the point 
source may be written 


|h(xa,ya3x,y)|? = kre {- 2 E =" (1 se =)? |, 2 E st: (1+ =) v|| 


The intensity at output coordinates (—zxa,—ya) can thus be written as 
the convolution integral 


To, Va) 

= [f mead (2 | -(14: =) |, fue (1+ S)u]| ae ay 
(7-19) 

Limitations 


Incoherent processing systems based on geometrical optics have a common 
limitation: because of the incoherent nature of the illumination, the 
input data and impulse response are restricted to being nonnegative 
(intensity) distributions. While techniques for realizing the equivalent 
of a negative intensity distribution do exist (see, for example, Ref. 7-21), 
they are usually cumbersome and inconvenient. Thus there is no simple 
and direct way of processing bipolar inputs with bipolar impulse responses 
when incoherent illumination is used. 

In addition, all systems designed on the basis of geometrical optics 
must satisfy a common constraint: the geometry of the system must be 
chosen in such a way that diffraction effects are entirely negligible. 
This requirement is readily satisfied by the system of Fig. 7-14a, and is 
not difficult to meet with the system of Fig. 7-14b. However, the systems 
of Figs. 7-15 and 7-16 can be highly deficient in this respect, as will be 
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evident from the discussion that follows. Referring to Prob. 2-12, a 
convenient measure of the complexity of a given function is its space- 
bandwidth product. This product represents, to a good approximation, 
the number of independent data points that are contained by the function 
in question. In order to obtain the highest possible efficiency in a data- 
processing system, that system should be designed to process input 
functions of the largest possible space-bandwidth product. 

Consider the processing system of Fig. 7-15. To maximize the space- 
bandwidth product of the system, we would attempt to place as many 
independent data points on the transparencies as possible. But as the 
spatial structure on the transparencies becomes finer, more and more of 
the light passing through the first transparency is diffracted, with less and 
less of the light obeying the laws of geometrical optics. Thus the method 
by which this system was analyzed becomes less and less accurate, and 
the output departs from the desired result. 

Similar arguments apply for the system of Fig. 7-16. As the structure 
on the second transparency becomes finer and finer, the light incident on 
plane P from a point source on 7, will resemble a geometrical projection 
of rz less and less. Thus the method of analysis by which the system 
performance is predicted is not accurate when the space-bandwidth 
product of the transparencies becomes high. 

While we have considered only specific examples, a fundamental fact 
is clear: if large quantities of data are to be squeezed into an aperture of a 
given size, ultimately diffraction effects must be taken into account. It 
is therefore important to be sure that when a system is designed on the 
basis of geometrical optics, it is used in a way which assures the accuracy 
of the laws of geometrical optics. 


7-4 FREQUENCY-DOMAIN SYNTHESIS 


It is also possible to synthesize desired linear-filtering operations in the 
frequency domain rather than the space domain. When coherent illumina- 
tion is used, the desired filtering operation can be synthesized by direct 
manipulation of the amplitude transmittance across the back focal plane 
of a transforming lens. Examples of this type of processing system have 
already been seen in the discussions of filtering of photographs (Maréchal) 
and of phase-contrast microscopy (Zernike). 

When incoherent illumination is used, frequency-domain synthesis is 
still possible, for there exists a simple autocorrelation-function relation- 
ship between the amplitude transmittance of the pupil and the optical 
transfer function of the system. However, the incoherent system has two 
distinct shortcomings. First, the class of transfer functions that can be 
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directly synthesized is limited to the class of all autocorrelation functions; 
equivalently, only nonnegative real impulse responses can be realized. 
In many practical applications, more general transfer functions are 
required. Second, even if the desired transfer function does happen to 
correspond to a nonnegative real impulse response, the pupil function 
required is not unique and there is no known systematic procedure for 
finding the simplest pupil function that will yield the desired transfer 
function. 

For these reasons, attention will be restricted to frequency-domain 
processing with coherent illumination. This is not to imply that fre- 
quency-domain processing is never of interest when the illumination is 
incoherent. For an example of an incoherent! system in which frequency- 
domain synthesis is used, see Ref. 7-22. 


Coherent-system configurations 


Coherent systems, being linear in complex amplitude, are capable of 
realizing operations of the form 


Tey) =K| ff oenhe—&y—n)dtenf (7-20) 


Two possible configurations for realizing this type of operation are 
illustrated in Fig. 7-17. For system (a), light from the point source S is 
collimated by lens Li. The input to be filtered is inserted as a space- 
varying amplitude transmittance g(x1,y1) in plane P;. Lens L. Fourier 
transforms? g, producing an amplitude distribution kiG(x2/Af, y2/Af) 
across P2, where G is the Fourier transform of g and k, is a complex 
constant. A filter is inserted in this plane to manipulate the amplitude and 
phase of the spectrum G. If H represents the Fourier transform of h, then 
the amplitude transmittance of the frequency-plane filter should be 


t(v2,y2) = koH (2 2) (7-21) 


The amplitude distribution behind the filter is thus proportional to GH. 
Finally, the lens Ls; transforms this amplitude distribution to yield an 


1 This system also uses coherent light in an initial step, but only to cast the inputs 
into a specific form that allows the desired operations to be realized with incoherent 
illumination. 

2 It is, of course, necessary to choose the size of the lens sufficiently large to assure 
that vignetting effects are negligible (see Prob. 5-4). 


SPATIAL FILTERING AND OPTICAL INFORMATION PROCESSING / 167 


ahaa Me Shear cea cis 2 oc 
(b) 


Figure 7-17 Two possible configurations for coherent processing 
systems. 


intensity distribution 
Ieuys) = K| ff a(en)h(—2s — &, —ys — 0) dé dn 


across P;. The negative signs preceding x; and y; are a consequence of 
the reflection of the output introduced by the sequence of two Fourier 
transformations, rather than Fourier transformation followed by inverse 
Fourier transformation (cf. Prob. 2-3a). This awkwardness can be 
removed by redefining the (x3,y3;) coordinate system in a reflected 
geometry, as has been illustrated in Fig. 7-17a. Such a convention, which 
will be adopted hereafter, allows the output intensity distribution to be 
expressed as 


I(x3,Ys) = K| ef g(E,n)h(xs ane é, US n) dé dn : (7-22) 


Figure 7-18 shows a photograph of an optical processing system of the 
type described above. 
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Figure 7-18 Photograph of a coherent processing system. S, laser source; P, pinhole 
filter; Ly, collimating lens; Gi, liquid gate (input plane); Lo, trans- 
forming lens; G2, liquid gate (frequency plane); L3, transforming lens; 
F, film holder (output plane). 


A second configuration for performing the same operation is shown 
in Fig. 7-17b. The lens L, again collimates the light from the point source 
S. In this case the lens L2 performs both the Fourier transforming and 
the reimaging operations. The input transmittance g is applied in plane 
P,, which in this case is located a distance 2f in front of Lo.! Across the 
back focal plane of Lz appears the spectrum G, this time multiplied by the 
quadratic phase factor exp | -i5; (@_? + w)| since the input is not in 
the front focal plane [see Eq. (5-19)]. The frequency-plane filter with 
amplitude transmittance proportional to H is again inserted in P2. The 
plane P; is located at distance 2f behind the lens; thus the lens law is 
satisfied for planes P; and P3. Since the spectrum of the object has been 
multiplied by H, the intensity across P3 is again given by 


Iesys) = K| [f g(emh(wa — & ya — 0) dé dn 


‘ The distance 2f is not essential. It is only required that the lens law be satisfied 
for planes P; and P3. 
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There are two practical disadvantages attendant with the second 
configuration. First, as compared with system (a), the input for system (b) 
is twice the distance from the transforming lens Ly. This means that, for 
a given focal length and input size, L, must be larger for system (b) than 
for system (a) to assure that vignetting effects are negligible. A second 
disadvantage arises from the considerations that led to the approximation 
(5-29) in the analysis of the coherent imaging properties of a lens. In that 
formulation, we found it necessary to assume that the amplitude of the 
image at any particular point consisted of contributions from the object 
amplitude in only a small region surrounding the geometrical object 
point. If the filtering operation represented by H is of high space-band- 
width product, the impulse response h will extend over a sizable area and 
the output of system (b) must then be regarded as a filtered version of the 
function g(x1,y1) exp E = (a? + w)| rather than simply g(«1,y1). This 
problem is not encountered with system (a), which casts an image of a 
plane (P) onto a plane (P3), rather than of a sphere onto a sphere. 

Finally, it should be emphasized that coherent systems are often 
arranged to process a multitude of functions of one independent variable, 
rather than a single function of two independent variables. An example 
of these so-called astigmatic processors is shown in Fig. 7-19. The col- 
limating lens 1; is followed by the input data in plane P;. The input data 
consist of a vertical array of one-dimensional transmittance functions 
each running horizontally. A cylindrical lens Lz follows, placed one focal 
length f from P; and having power in the vertical dimension. At distance 
2f beyond L+ is placed a spherical lens L3, which again has focal length f. 
The ‘frequency plane” now appears at P2. The lens combination Le,L; 
has performed a double Fourier transformation in the y direction, thus 
imaging the vertical dimension. Since L2 exerts no power in the z direc- 
tion, the spherical lens L; Fourier transforms the horizontal dimension, 


Ls P3; 


kK fotx—2 mle fob fofeoy—ofe jo] 


Figure 7-19 Example of an astigmatic processor. 
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k : 
up to a phase factor exp (=i n) across P». This phase factor can be 


removed by placing a negative cylindrical lens of focal length f/2 
immediately in front of P:, thus introducing the multiplicative phase 
transformation 


ti(z2) = exp (i; x) (7-23) 


which cancels the phase curvature. If the input array is the set of trans- 
mittance functions gz(r1), k = 1, 2,..., K, then across P2 we find 
displayed the corresponding set of transforms G;(r2),k = 1,2, ... , Ge 
with the vertical order inverted by the imaging operation. 

A linear array of one-dimensional filters may now be introduced in 
plane P2. The lens pair Z;,L5 again images in the y dimension and Fourier 
transforms in the x dimension, thus retaining the array structure but 
returning the individual functions to the “space domain.” The phase 
factor associated with the final Fourier transformation is generally of 
no concern. 


Limitations 


While coherent systems are in general more flexible and have greater 
data-handling capacity than incoherent systems, nonetheless there are 
certain limitations to the types of operations that can be realized. We 
consider here the limitations associated with systems that achieve a 
desired transfer function by independent control of the amplitude and 
phase transmittances in the frequency plane. More sophisticated tech- 
niques for realizing the frequency-plane masks, based on interferometric 
recording, are free from some of these limitations, as will be discussed 
in Sec. 7-5. 

The traditional means for realizing a given transfer function has 
been by the insertion of independent amplitude and phase masks in the 
frequency plane. The amplitude transmittance is conveniently controlled 
by a photographic plate, presumably immersed in a liquid gate. The phase 
transmittance may be controlled by insertion of a transparent plate with 
an appropriately varying thickness. In general, it is difficult to achieve 
more than two levels of phase control (that is, 0 or 180° relative delay) 
by this technique. The phase-shifting plates may be ruled on a substrate 
in much the same way that diffraction gratings are ruled, or they may be 
realized by evaporation techniques or film-reliefing techniques. All such 
methods are cumbersome and can be successfully employed only when 
the desired pattern of binary phase control is relatively simple. Thus 
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there is a definite practical limitation that restricts realization to rather 
simple transfer functions. It should be noted that, even if a very simple 
impulse response is required (e.g., the character ‘“‘P’’), the corresponding 
transfer function may be (1) difficult to calculate analytically in advance, 
and (2) far too complicated to be synthesized by these techniques. 

Finally, if input functions of large space-bandwidth product are to 
be filtered with an impulse response of large space-bandwidth product, 
alignment of the filter mask in the frequency plane may require extremely 
precise positioning accuracy (see Prob. 7-6). However, this problem can 
usually be overcome by the use of accurate micropositioners. 

In summary, then, the most severe limitation to the traditional 
coherent processor arises from the difficulty of simultaneously controlling 
the amplitude and phase in the frequency plane in any but a simple 
pattern. It was not until 1963, with the origination of interferometrically 
recorded frequency-plane filters, that this serious limitation was, to a large 
extent, overcome. This more recent advance in the field of optical process- 
ing is the subject of the section to follow. 


7-5 THE VANDER LUGT FILTER 


In 1963, A. B. Vander Lugt of the University of Michigan’s Radar 
Laboratory proposed and demonstrated a new technique for synthesizing 
frequency-plane masks for coherent processors [Refs. 7-23, 7-24]. The 
masks synthesized by this technique have the remarkable property that 
they can effectively control both the amplitude and phase of a transfer 
function in spite of the fact that they consist solely of patterns of absorp- 
tion.! By means of these techniques it is possible to overcome the two 
most serious limitations of traditional coherent processing systems. 


2 


Synthesis of fhe frequency-plane mask 


The frequency-plane mask for a Vander Lugt filter is synthesized with 
the help of an interferometric system, such as that illustrated in Fig. 7-20. 
The lens 1; collimates the light from the point source S. A portion of this 
light strikes the mask Pi, which has an amplitude transmittance that is 
equal to the desired impulse response h. The lens Ly Fourier transforms 
the amplitude distribution h, yielding an amplitude distribution 


1 Historically, the Vander Lugt filter was preceded by a very similar technique, 
now known as the hard-clipped filter. For such filters the frequency-plane mask was 
computed digitally rather than recorded optically. While the hard-clipped filter was 
used in radar data processing as early as 1961, due to classification it did not appear 
in the open literature until 1965 [Ref. 7-25]. 
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response 


Figure7-20 Recording the frequency-plane mask for a Vander Lugt filter. 


ee H (= “) incident on the film. In addition, a second portion of the 
Af \Af AF 
collimated light passes above the mask P;, strikes a prism P, and is 
finally incident on the film at an angle 6 as shown. 

The total intensity incident at each point on the film is determined 
by the interference of the two amplitude distributions present. The 
tilted plane wave incident from the prism produces a field distribution 


U,(x2,Y2) = T exp (—j2raye) (7-24) 
where the spatial frequency a@ is given by 
sin 6 


= =) 
a i (7-25) 


The total intensity distribution may therefore be written 


= 72 + : H o H ae aye (= 28) exp nei 


Te Yo 
ae te ) exp (—j2may2) (7-26) 


Note that if the complex function H has an amplitude distribution 
A and a phase distribution y, that is, if 


be Ye es T2 Yo 2 Y2 
: G =) . ( z a |- he (% a 
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then the expression for J can be rewritten in the form 


il V2 Y2 
g a ee ee Le 
(12,42) r rf? A @ .) 


vu Gat x 
tara Gex)ee Perv a] 29 
This form illustrates the means by which the interferometric process 
allows the recording of a complex function H on an intensity-sensitive 
detector: the amplitude and phase information is recorded, respectively, 
as amplitude and phase modulations of a high-frequency carrier that is 
introduced by the tilted ‘“‘reference’”’ wave from the prism. 

There are, of course, other optical systems that will produce the 
same intensity distribution as that of (7-27). Figure 7-21 illustrates two 
additional possibilities. System (a) consists of a modified Mach-Zehnder 
interferometer. By tilting the mirror M,, a tilted plane wave is produced 
at plane P2. In the lower arm of the interferometer, the lens Lz again 
Fourier transforms the desired impulse response. The final beam splitter 
allows the addition of these two waves at the film plane. 

System (b), which consists of a modified Rayleigh interferometer, 
provides a third means for producing the same intensity distribution 
[Ref. 7-2, p. 128]. The collimating lens LZ, is followed by a smaller lens Lz, 
which focuses a portion of the collimated light to a bright spot in the 
front focal plane of lens L3. When the spherical wave generated by this 
“reference point”? passes through Zs, it is collimated to produce a tilted 
plane wave at the film plane. The amplitude transmitted by the impulse- 
response mask is Fourier transformed in the usual fashion. Thus an 
intensity distribution similar to Eq. (7-27) is again produced at the 
recording plane. 

As a final step in the synthesis of the frequency-plane mask, the 
exposed film is developed to produce a transparency which has an ampli- 
tude transmittance that is proportional to the intensity distribution 
incident during exposure. Thus 


pe eee oH exp Goran) 
t(©2,Y2) & To X22 Wie 


+ H* exp (—j2may2) (7-28) 


Note that, aside from the simple complex-exponential factor, the third 
term of the transmittance is proportional to H and therefore has exactly 
the form required to synthesize a filter with impulse response h. It remains 
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(b) 
Figure 7-21 Two alternative systems for producing the frequency-plane 


mask. (a) Modified Mach-Zehnder interferometer; (6) 
modified Rayleigh interferometer. 


to be demonstrated how that particular term of the transmittance can 
be utilized and the other terms excluded. 


Processing the input data 


Once the frequency-plane mask has been synthesized, it may be inserted 
in the usual processing system illustrated previously in Fig. 7-17a. If 
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the input function to be filtered is g(1,y1), then incident on the frequency- 
plane mask is an amplitude distribution given by wy G E 2) hes 


Af \Af AS 
field strength transmitted by the mask then obeys the proportionality 
Pa) 
EY, or HIG + ia ht 


36 = H*G exp (—j2ray2) 


The final lens L3 of Fig. 7-17a Fourier transforms U, (and multiplies 


by 1/\f). Taking note of the reflected coordinate system in P3, the field 
strength in that plane is found to obey the proportionality 


Us(x3,y3) < ro’g(r3,y3) + <> [h(as,ys) * h*(—a3, ys) * g(axa,ys)] 


ae 
+ vp hea) * 9(X3,Y3) * 6(r3,ys + adf)] 
au vlh *(—23,—Ysa) * G(r3,Ys) * 5(x3,ys — adf)] (7-29) 


The third term of this expression is of particular interest. Noting that 


h(xs,ys) * g(X3,Ys) * 6(x3,ys + af) 


os // h(zs — £, ys + adf — n)g(én) dé dn (7-80) 


this portion of the output is seen to yield the convolution of h and g 
centered at coordinates (0,—aAf) in the x3y; plane. 
For future reference, the fourth term of Eq. (7-29) may be rewritten 


h*(—23,—ys) * g(s,Ys) * 5(%3,ys — af) 


= [ff algmb*(e — 25,1 — ys + adf) dé dy (7-31) 


which is the crosscorrelation of g and h centered at coordinates (0,aAf) 
in the x3y3 plane. 

Note that the first and second terms of Eq. (7-29), which are of no 
particular utility in the usual filtering operations, are centered at the 
origin of the z3y3 plane. Thus it is clear that if the “carrier frequency” a 
is chosen sufficiently high, or equivalently, if the reference wave is 
introduced at a sufficiently steep angle, the convolution and crosscorrela- 
tion terms will be deflected (in opposite directions) sufficiently far off-axis 
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to be viewed independently. To find the convolution of h and g, the 
observer simply examines the distribution of light centered about coor- 
dinates (0,—aaf). To find the crosscorrelation of h and g, the observation 
is centered at coordinates (0,aAf). 

To illustrate the requirements placed on a more precisely, consider 
the widths of the various output terms illustrated in Fig. 7-22. If the 
maximum width of h in the y direction is W;, and that of g is W,, then the 
widths of the various output terms are as follows: 


Geely HAG 


2. a [h(x3,y3) *h*(—2x3,—Yys) * g(r3,y3)] ~ 2W, + W, 
3. Te th(vsus) # a(aas) * Blzoys + adi] Wa + We 


4. sg lh*(— as, — Us) * g(X3,Ys) = 5(X3,Ys bas adf)] a Wh az W, 


From the figure it is clear that complete separation will be achieved if 


LF? 
or equivalently, if 
3, OW; 
Oa (7-32) 
2 f "3 


where the small-angle approximation sin 6 & 6 has been used. 


Advantages of the Vander Lugft filter 


The use of a Vander Lugt filter removes the two most serious limitations 
to conventional coherent processors. First, when a specified impulse 
response is desired, the mathematically complicated and laborious task 
of finding the required transfer function is eliminated; the impulse 
response 1s Fourier transformed optically by the system that synthesizes 
the frequency-plane mask. Second, the generally complicated complex- 
valued transfer function is synthesized with a single absorbing mask; the 
phase transmittance through the frequency plane need no longer be 
controlled in a complicated manner. The absorbing mask is simply 
immersed in a liquid gate to eliminate all relative phase shifts. 

The Vander Lugt filter remains very sensitive to the exact position 
of the frequency-plane mask, but no more sensitive than the conventional 
coherent processor. The recording of the modulated high-frequency carrier 
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Figure 7-22 Location of the various terms of the processor 
output. 


Y3 


requires a higher-resolution film than might ordinarily be used to syn- 
thesize the mask, but films with adequate resolution are available (e.g., 
Kodak Spectroscopic Plates) and this requirement poses no particular 
problem. 

Note that the Vander Lugt technique offers an important new flexi- 
bility to coherent processing. Whereas previously the realization of the 
frequency-plane mask was the major practical problem, the difficulties 
are now transferred back to the space domain. The difficulties are in 
general much less severe in the space domain, for the impulse responses 
required are often simple, and the necessary masks can be constructed by 
conventional photographic techniques. Thus the Vander Lugt filter has 
extended the use of coherent processors to a new and previously unattain- 
able realm of operations. Many of the most promising applications of 
coherent processing fall in this new realm. 


7-6 APPLICATION TO CHARACTER RECOGNITION 


A particular application of optical processing that has been of interest 
for many years is found in the field of character recognition. As we shall see, 
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this application affords an excellent example of desired processing opera- 
tions with simple impulse responses but not necessarily simple transfer 
functions. The Vander Lugt synthesis technique is therefore particularly 
well suited for this application. 


The matched filter 


The concept of a matched filter plays an important role in character- 
recognition problems. By way of definition, a linear space-invariant filter 
is said to be matched to the particular signal s(a,y) if its impulse response 
h(z,y) is given by 

h(z,y) = s*(—2,—y) (7-33) 


If an input g(z,y) is applied to a filter matched to s(x,y), then the output 
v(z,y) is found to be 


v(oy) = ff hoe — & y = ng (En) dé dn 


ee ff g(én) s*(é — 2,4 — y) dé dy (7-34) 


which is recognized to be the crosscorrelation function of g and s. 

Historically, the concept of a matched filter first arose in the field of 
signal detection; if a signal of known form, buried in ‘‘white’’ noise, is to 
be detected, then a matched filter provides the linear operation which 
maximizes the ratio of instantaneous signal power (at a particular time) 
to average noise power [Ref. 7-26]. However, in the present application, 
the characters will be assumed noiseless, and the use of a particular 
filtering operation must be justified on other grounds. 

Considerable insight into the nature of the matched-filtering opera- 
tion is provided by an optical interpretation, as illustrated in Fig. 7-23. 
Suppose that a filter, matched to the input signal s(z,y), is to be synthe- 
sized by means of a frequency-plane mask in the usual coherent processing 
geometry. Fourier transformation of the impulse response (7-33) shows 
that the required transfer function is 


H(fx,fy) = S*(fx,fy) (7-35) 


where H = S{h} and S = ${s}. Thus the frequency-plane mask should 
have an amplitude transmittance proportional to $*. 

Consider now the particular nature of the field distribution trans- 
mitted by the mask when the signal s (to which the filter is matched) is 
present at the input. Incident on the mask is a field distribution propor- 
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Figure 7-23 Optical interpretation of the matched- 
jiltering operation. 
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tional to S, and transmitted by the mask is a field distribution propor- 
tional to SS*. This latter quantity is entirely real, which implies that the 
frequency-plane mask exactly cancels all the curvature of the incident 
wavefront S. Thus the transmitted field distribution is a plane wave, 
which is brought to a bright focus by the final transforming lens. When 
an input signal other than s(z,y) is present, the wavefront curvature will 
in general not be canceled by the frequency-plane mask, and the trans- 
mitted light will not be brought to a bright focus by the final lens. Thus 
the presence of the signal s can conceivably be detected by measuring the 
intensity of light at the focal point of the final transforming lens. (If the 
input s is not centered at the origin, the bright point in the output plane 
simply shifts by a distance equal to the misregistration distance; cf. 
Prob. 7-7.) 


A character-recognition problem 


Consider the following specific character-recognition problem: The input 
g to a processing system may consist of any one of N possible characters 
Si, So, . - . , Sy, and the particular character present is to be determined 
by the processor. As will now be demonstrated, the identification process 
can be realized by applying the input to a bank of WN filters, each matched 
to one of the possible input characters. 

A block diagram for the recognition machine is shown in Fig. 7-24. 
The input is simultaneously (or sequentially) applied to the N matched 
filters with transfer functions Sf, Sf, ... , Sy. The response of each 
filter is normalized by the square root of the total energy in the character 
to which it is matched. This normalization, which can be accomplished 
electronically after detection of the filter outputs, takes account of the 
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Figure 7-24 Block diagram of a character-recognition system. 


fact that the various input characters will generally not be of equal 
energy. Finally, the squared moduli of the outputs |v1|?, |v2|?, . . . , |vwl? 
are compared at the particular points where their maximum outputs would 
be anticipated (assuming that the character to which they are matched 
is present in each case). As will now be demonstrated, if the particular 
character 


g(x,y) = sx(x,y) 


is actually present at the input, then the particular output |v;|? will be 
the largest of the N responses. 


To prove this assertion, first note that, from Eq. (7-34), the peak 
output |v,|? of the correct matched filter is given by 


[ ffiwitasanf 


Mi} ae ae flee (7-36) 


ffmcaa= 2 
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On the other hand, the response |v,|? (n # k) of an incorrect matched 


filter is given by 
| ff s.8* dé dy e 
eo ——— (7-37) 


i Isal? dE dy 


But, using Schwarz’ inequality, we have 


| ff ost deta) < ff ta adn ff oa ae 


It follows directly that 


Ival? < fff |sel® dé dy = vel? (7-38) 


with equality if and only if 


Sn(x,y) a KSx(2,Y) 


From this result it is evident that the matched filter does provide a 
means of recognizing which character, of a possible set of characters, is 
actually being presented to the system. It should be emphasized that this 
capability is not unique to the matched filter. In fact, in some cases it is 
possible to modify (i.e., mismatch) all the filters in such a way that the 
discrimination between characters is improved (see, for example, Ref. 
7-2, pp. 1380-133). 


Optical synthesis of a character-recognition machine 


The matched-filter operation can readily be synthesized by means of the 
Vander Lugt technique discussed previously. This is particularly simple 
since it will be recalled that one of the outputs of the Vander Lugt filter 
is itself the crosscorrelation of the input with the original character from 
which the filter was synthesized. By restricting attention to the proper 
region of the output space, the matched-filter output is readily observed. 

Figure 7-25a shows a photograph of the impulse response of a Vander 
Lugt filter which has been synthesized for the character ‘‘P.’”’ The upper 
portion of the response will generate the convolution of the input data 
with the symbol P, while the lower response will generate the cross- 
correlation of the input with the letter P. The central portion of the 
response is undesired and not of interest. 
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Figure 7-25 Photographs of (a) the impulse response of a Vander Lugt filter, and 
(b) the response of the matched-filter portion of the output to the letters 
Q, W, and P. 


Figure 7-25b shows the response of the matched-filter portion of the 
output to the letters Q, W, and P. Note the presence of the bright point 
of light in the response to P, indicating the high correlation between the 
input and the letter to which the filter is matched. 

To realize the bank of matched filters illustrated in Fig. 7-24, it 
would be possible to synthesize N separate Vander Lugt filters, applying 
the input to each filter sequentially. Alternatively, if N is not too large, 
it is possible to synthesize the entire bank of filters on a single frequency- 
plane mask. This can be done by frequency multiplexing, or recording 
the various frequency-plane masks with different carrier frequencies on 
a single transparency. Figure 7-26a illustrates one way of recording the 
multiplexed filter. The letters Q, W, and P are at different angles with 
respect to the reference, and as a consequence, the crosscorrelations of 
Q, W, and P with the input character appear at different distances from 
the origin, as illustrated in Fig. 7-26b. 
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The number of different filters that can be realized by this technique 
is limited by the dynamic range that can be achieved in the frequency- 
plane mask. Synthesis of nine separate impulse responses with a single 
mask has been demonstrated by Vander Lugt [in Ref. 7-2, pp. 133-139]. 


Sensitivity to scale size and rotation 


The coherent optical character-recognition technique suffers from 
deficiencies which are shared by all matched-filter approaches to the 
character-recognition problem. Specifically, the filters are generally rather 
sensitive to scale-size changes and rotations of the input characters. 
When the input character is presented with an improper angular orienta- 
tion or improper magnification, the response of the correct matched filter 


Reference 
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Crosscorrelation 
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Crosscorrelation 
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Figure 7-26 Synthesis of a bank of matched filters with a single 
frequency-plane mask. (a) Synthesizing the frequency- 
plane mask; (b) format of the matched-filter portion of 
the output. 
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Figure 7-27 Sidelooking radar geometry. 


is reduced, and errors arise in the character-identification process. The 
degree of sensitivity to rotation and scale size depends to a large extent 
on the structure of each character (e.g., an ‘‘Z”’ is more rotation-sensitive 
than an ‘‘O’’), but it can be controlled to some extent by clever filtering 
operations that reduce this sensitivity. Often a reasonable solution is to 
perform a mechanical search through various scale sizes and orientations. 
Such operations are time-consuming, however. For a more detailed 
discussion of these problems see Ref. 7-2, Chap. 7. 


7-7 APPLICATION TO PROCESSING SYNTHETIC-APERTURE 
ANTENNA DATA 


One of the most successful applications of the optical information-process- 
ing technology has been in the formation of high-resolution terrain maps 
from data collected by so-called synthetic-aperture radars. The entire 
discussion that follows is based on a paper by Cutrona et al. [Ref. 7-11], 
to which the reader is referred for more details. 


Formation of the synthetic aperture 


With reference to Fig. 7-27, consider a radar system carried by an aircraft 
flying with uniform velocity v, along a linear flight path in the z direction. 
Suppose that the function of the radar is to obtain a high-resolution map 
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of the terrain reflectivity across an area adjacent to the flight path. 
Resolution in range from the flight path is obtained by transmitting 
pulsed radar signals and recording the received returns as a function of 
time. Resolution in azimuth could in principle be obtained by using a 
radar beam of extremely narrow azimuthal extent. However, the azi- 
muthal resolution obtainable at range R by an antenna of linear extent 
D is roughly \,R/D. Since the microwave wavelength X, is orders of mag- 
nitude longer than optical wavelengths, antennas so large as to be 
impractical would be required to obtain resolutions comparable with those 
of photoreconnaissance systems. 

A solution to this problem is offered by the synthetic-aperture 
technique. Let the aircraft carry a small, broadbeam antenna which points 
in a fixed sidelooking direction with respect to the aircraft. Radar pulses 
are transmitted from a sequence of positions along the flight path, and the 
time records (amplitude and phase) of the radar returns observed at these 
positions are recorded. Each such record may be regarded as the signal 
that would be obtained from a single element of an array, and the various 
recorded waveforms need only be properly combined to synthesize an 
effective aperture that can be hundreds or thousands of meters long. 
Note that to realize the longest possible synthetic array, the radar antenna 
must illuminate a given terrain point for the longest possible portion of the 
flight path. Thus the broader the beamwidth of the radar antenna, the 
higher the resolution that can potentially be obtained from the received 
data. Achievement of the full resolution of the array further requires that 
a stable phase reference be available in the aircraft, enabling both the 
amplitude and the phase of the return to be recorded along the flight path. 


The collected data and the recording format 


To examine the signal-collecting process in more detail, consider first the 
geometry illustrated in Fig. 7-28. The distance along the flight path is 
represented by the coordinate x. For simplicity we assume that a simple 
point scatterer exists at coordinate x1, which lies at a perpendicular 
distance 7; from the flight path. For additional simplicity we assume that 
the waveform transmitted by the radar is a steady sinusoid of frequency 
f,. The pulsed nature of the actual transmitted signal results simply in a 
periodic sampling of the signal predicted under the sinusoidal assumption. 
Sampling theory (cf. Sec. 2-3) predicts that the discrete nature of the 
samples can be neglected for the purposes of azimuthal considerations, 
provided the distance traveled by the aircraft between samples is less 
than (2Bx)~!, where Bx is the spatial bandwidth of the terrain reflections. 
Consideration of the pulsed nature of the signal is, of course, essential 
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Figure 7-28 Flight-path geometry. 


for obtaining the other dimension of resolution—range from the flight 
path. 

The signal returned to the aircraft from the point scatterer under 
consideration can be represented by the time-varying phasor 


si(t) = 61 exp Ez (: - *)| (7-39) 


where f, is the RF frequency of the radar, r is the range from aircraft to 
target, c is the velocity of light, and o; is a complex amplitude factor 
which depends on such parameters as transmitted power, target reflec- 
tivity and phase shift, and inverse fourth-power propagation attenuation. 
The distance r may be expressed in terms of 11, x1, and zx <the flight-path 
coordinate) by 


i 2 
ae Gea (7-40) 
2r1 


yielding a received signal 


si(t) = o1(21,71) exp | j Ez = = es =< || (7-41) 


The motion of the aircraft links the space variable z to the time variable 
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¢ through the prescription 


XL = Vat 


If the terrain at range r; is regarded as consisting of a collection of 
many point scatterers, the total returned signal can be written 


s() = ) salt) 
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Arr, Qa 


= ) On(Xn,%1) CX ) | Qnf,t — 
d ( 0 . f | gh Ar ArT 1 
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The returned signal is synchronously demodulated, which amounts 
simply to translating the center frequency of the return from f, to a new 
lower frequency f., yielding 


Arr, Qa 


Ar Ar? 1 


of) i= > lon(Xn,71)| COS Ez — 


n 


(vat = aye =F 6.| (7-43) 


where we have here abandoned the phasor notation. 

The demodulated signal is used to intensity-modulate a cathode-ray 
tube, the electron beam being swept vertically during the time interval 
of interest after each transmitted pulse. If film is drawn past the scope 
face with horizontal velocity v;, the recording format shown in Fig. 7-29 
is obtained. The vertical lines represent successive range sweeps, while 
the azimuthal position of a given scatterer with respect to the radar 
varies along the horizontal dimension. 


Focal properties of the film transparency 


We again limit attention to a single range 7;, thus considering only the 
data recorded along a line y = y; on the film, and we again neglect the 
pulsed nature of the transmitted signal. With proper care in exposure, 
the azimuthal history of the received signal can be made to generate a 


<= film motion 
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intensity-modulated 
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Figure 7-29 Recording format. 
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photographic record with amplitude transmittance given by 
(nan) = t& +x) lon(ensrs)| 


2 
cos | arf by aw oe 6 x v2) + 6.| (7-44) 


dr ArT 1 Uy 


where t is a bias transmittance introduced to allow recording of the 
bipolar video signals, and x is simply a constant. In writing (7-44), use 
has been made of the relation! 


x = vyt 

from which it follows that 
Sik 
fx = 


By decomposing the cosine of Eq. (7-44) into two complex-expo- 
nential factors, the transmittance may be expressed as the sum of the 
bias ¢, and two additional terms 


x , “lo Qa Va 2 UF 2 
fee) = 5 » o,,(fn;%1) CXp 49 | 2afxe— es : xz Pte 


and (7-45a) 
=< a : 55) Se eae (eat (een ee 
te(2,y1) = 5 2 o, (Lnjy11) EXP | J Es E. (") (: ss ae 
(7-45b) 
(7-45b) 


where the constant phase 477r1/)\,, as well as the phases ¢,, have been 
absorbed in the definition of the o’,. 

Restricting attention to only one of the point scatterers, say the one 
with index n = N, the appropriate component of t, is 


12 (wan) = 5 ov (eman) exp (j2afx2) 


5) aXe 2 
exp | =! ae: (=) (: ae v) | (7-46) 
Al 1 Uy Va 


The first exponential term, having a linear phase dependence, introduces 
a simple tilt in the phase front of this component of transmitted light. 
The angle 6 of tilt from the transparency plane may be determined from 


1 Note that the variable z is now used as a film coordinate, whereas previously it 
represented the radar coordinate. 
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the relation 
sin 6 = dfx (7-47) 


where }, is the wavelength of the light. 

Turning to the second exponential factor, we note its close resem- 
blance to the transmittance function of a positive cylindrical lens, centered 
at coordinate x = 2,, 


ti(z) = exp |i me 


where f; is the focal length. Equating (7-46) and (7-48), we find that this 
factor of t. behaves like a positive cylindrical lens, with focal length 


(x ie n)*| (7-48) 


1 Ne Uy 2 
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and with the lens axis located at coordinate 
Us 
t= ioe (7-50) 


In a similar fashion, the Nth component of tz, 
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has an initial exponential factor which introduces a wavefront tilt in the 
opposite direction, i.e., at angle —é@, and a second exponential factor 
which is identical with the transmittance of a negative cylindrical lens, 
again centered at x = (v;/va)ay and again with focal length f; given by 
Kq. (7-49). 

Figure 7-30 illustrates the three components of light transmitted for 
the case of a single point scatterer. The bias transmittance t allows the 
incident optical wave to pass through the transparency, uniformly 
attenuated but otherwise unchanged. The components t and ts” of the 
transmittance may be regarded as generating a pair of “images” of the 
azimuthal dimension of the point scatterer in the following sense: the 
component t2” focuses light to a bright line focus behind the transparency, 
while the component ts” produces a wave that appears to originate from 
a line source in front of the transparency (see Fig. 7-30). 

If a multitude of point scatterers is present at range 71, each generates 
its own pair of real and virtual line foci during the optical step. The 
relative azimuthal positions of the point scatterers determine the relative 
positions of the centers of the lenslike structures on the film, and therefore 
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Figure 7-80 Light transmitted by the line y = y, of the film transparency. 


are preserved in the relative positions of the corresponding line foci. 
Thus an entire image of the azimuthal distribution of scatterers at range 
r, is recreated across appropriate planes in front of and behind the trans- 
parency. We again emphasize that this image is spread in the y dimension, 
since the film exerts no focal power in this direction. 


Construction of the final image 


We ultimately wish to construct an image, not only of the azimuthal 
distribution of scatterers, but also of their distribution in range. The 
range coordinate of a scatterer has been transferred to the y coordinate 
of its azimuthal history on the film. Thus it is necessary to image the y 
variations of film transmittance directly onto the plane of focus of the 
azimuthal signals. This task is complicated by the fact that the focal 
length of the azimuthal variations is a function of range r; [see Eq. (7-49)] 
and therefore depends on the particular y coordinate under consideration. 
To construct the final radar map, it is evidently necessary to image the 
y variations of transmittance onto a tilted plane in which the azimuthal 
foci occur. 

This task can be accomplished with the optical system of Fig. 7-31. 
A positive conical lens (cf. Prob. 5-3) is inserted immediately behind the 
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transparency. The transmittance function of this lens is 


t(x,yi) = exp (-3 sa 2) (7-52) 


and its focal length depends linearly on the range coordinate (or the 
equivalent y coordinate) according to 


Comparison of Eqs. (7-51), (7-52), and (7-53) shows that this lens removes 
the entire tilted plane containing all the virtual line sources to infinity, 
while the y variations of transmittance remain undistorted. The azimuthal 
information is retained through the particular angles at which the infinitely 
distant virtual line sources lie. A cylindrical lens, placed one focal distance 
from the film, creates a virtual image of the y-dimension structure at 
infinity. The x and y images now coincide and must simply be brought 
back from infinity to a real image. A spherical lens, following the cylin- 
drical lens and placed one focal distance from the final observation plane, 
performs this final operation, yielding a real image of the range and 
azimuthal coordinates of the original terrain, both sharply in focus in 
the output plane. 


Final comments 


The utility of optical processing techniques in the formation of terrain maps 
from synthetic-aperture data lies in the extreme simplicity with which 
the optical system performs the rather complex and intricate linear 
transformation required to obtain an image. From a philosophical point 
of view, this simplicity must to a large extent be attributable to the 
identical nature (up to scaling factors) of the laws governing the propaga- 


Cylindrical 
lens Spherical 


‘ lens 


Conical 
lens 


Image 


plane 
/k— ; —+| asic ee 


Film 
plane 


Figure 7-31 Optical system for forming the image. 
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tion of the reflected radar waves on the one hand, and the optical waves 
in the processor on the other hand. As far as the azimuthal information 
is concerned, the processor may be regarded as creating an exact optical 
analog of the radar waves incident on the synthetic array. Once the optical 
analog has been generated, the image formation can be carried out by 
standard optical techniques. 

It should be emphasized that the creation of an optical analog of 
the radar waves has rested on the ability to retain both amplitude and 
phase information on the photographic transparency. The nonzero tem- 
poral frequency f, remaining after synchronous demodulation, and the 
consequent spatial frequency fx on the film, have played parts of the 
utmost importance in achieving this goal. The amplitude and phase 
variations of the azimuthal histories have been recorded as amplitude 
and phase modulations of the spatial carrier of frequency fx, as evident 
from Eq. (7-44). It is directly due to the presence of this carrier that the 
real and virtual line foci of Fig. 7-30 have been deflected at angles @ and 
— 6, respectively, from the transparency axis. Without such deflections, 
the virtual line focus, for example, could not be viewed without inter- 
ference from the directly transmitted light and the light contributing to 
the real line focus. 

The importance of the use of spatial-frequency carriers has already 
been emphasized in the discussion of the Vander Lugt filter and is again 
evident here. As we shall see in the final chapter, it is also largely due to 
the use of such techniques that the subject of wavefront-reconstruction 
imaging, or holography, has gained such new importance. 


PROBLEMS 
7-1 An object has a periodic amplitude transmittance described by 
t(x,y) ae t(x) ol 


where t(x) is shown in Fig. P7-1. This object is placed in the object plane of 
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Figure P7-1 
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Fig. 7-1, and a tiny opaque stop is introduced on the lens axis in the focal plane. 
Sketch the intensity distribution which results in the image plane. 


The so-called central dark ground method for observing phase objects is achieved 
by placing a tiny opaque stop in the back focal plane of the imaging lens to 
block the undiffracted light. Assuming that the phase delay through the object 
is always much less than 1 radian, find the observed image intensity in terms 
of the object phase delay. 


The Schlieren method for observing phase objects is achieved by the introduc- 
tion of a knife edge in the focal plane to block one-half of the diffracted light. 
Thus the transmittance through the focal plane may be written 


is(z,y) = 19(1 + sgn 2) 


(a) Assuming a magnification of unity and neglecting the image inversion, 
show that the image amplitude U; is related to the object amplitude U, by 


1 ) Ss U,' op t 
Ui(ri,yi) = 5 | vic — fl alesis i 
2 SL OI: 
(b) Let the object field be given by 
Uo(%o,Yo) = exp [J b(20,Yo)] 
where ¢ <1. Show that the image intensity can be approximated as 
1 2 < if, i 
Fik2i,43) = — E == | oe) a’ 
4 Ted Pe — 


(c) Find and sketch the image intensity distr ibution when 
( oO 0) 
; ect — 
(oo) x. y Po T xX 


Find an expression for the image intensity observed when the phase-shifting 
dot of the Zernike phase-contrast microscope is also partially absorbing, with 
intensity transmittance equal to a (0 <a <1). 


with ¢ <1. 


A low-contrast intensity distribution 
I(z,y) = Io + Ad(z,y) |Ag| K Io 


exposes a photographic plate, and a negative transparency is made. Assuming 
that J, biases the film in the linear region of the H&D curve, show that when 
the contrast Ag/9, is sufficiently low, the contrast distribution transmitted 
by the transparency is linearly related to the exposing contrast distribution. 


A certain coherent processing system has an input aperture which is 3 cm wide. 
The focal length of the initial transforming lens is 10 cm, and the wavelength 
is 0.6328 micron. With what accuracy must a frequency-plane mask be posi- 
tioned in the focal plane, assuming that the mask has a structure comparable in 
scale size with that of the input spectrum? 
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7-7 


7-8 


7-9 


7-10 


7-11 


Show that the position of the input to a matched filter affects the output only 
through the particular position at which maximum response may be expected. 


A transparency object with complex amplitude t(z,y) is placed immediately 
in front of a converging lens. The object is normally illuminated by a mono- 
chromatic plane wave, and a photographic plate records the intensity distribu- 
tion across the back focal plane. A positive transparency with a y of two is 
produced. The developed transparency is then illuminated normally by a plane 
wave, and the same converging lens is inserted directly behind the transparency. 
What is the relationship between the amplitude transmittance of the original 
object and the intensity distribution observed across the back focal plane of the 
lens in the second step of the process? 


Two input transparencies with amplitude transmittances h(z,y) and g(z,y) 
are placed in front of a converging lens, with their centers located at coordinates 
(z = 0, y = Y/2) and (z = 0, y = —Y/2), asshown in Fig. P7-9. The intensity 
distribution across the back focal plane of the lens is recorded, and a positive 
transparency with a y of 2 is produced. 

The developed transparency is placed in front of the same lens and retrans- 
formed. Show that the light amplitude across the back focal plane of the lens 
contains the crosscorrelation of h and g. Under what condition can the cross- 
correlation be separated from other output components? 
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Figure P7-9 


A phase object with amplitude transmittance t(z,y) = exp [j¢(zx,y)] is present 
in the object plane of a coherent imaging system. In the back focal plane of the 
system, an attenuating plate (of uniform thickness) with intensity transmittance 


t(x,y) = a(xt + 2x?y? + y4) 


is introduced. How is the resulting image intensity related to the object 
phase? 


An amplitude distribution 
U(x,y) = exp (j2xfiz) + exp (j2nf2x) 


is incident on a photographic plate. The modulation transfer function of the 
plate is M(f). A developed positive transparency with a y of four is produced. 
Assuming operation in the linear region of the H&D curve, plot the frequency 
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spectrum of the complex amplitude distribution transmitted by the trans- 
parency. Label the frequencies of the transmitted components in terms of 
fi and fo, and their amplitudes in terms of M(f) at its appropriate arguments. 


7-12 With reference to Fig. P7-12, a “misfocused”’ (incoherent) spatial filtering sys- 
tem (cf. Sec. 7-3) is to be designed such that the first zero of its transfer func- 
tion falls at frequency f, cycles/em. Assuming that the data to be filtered 
are placed at distance 2f in front of a circular lens of diameter L, what ‘‘mis- 
focus distance’”’ A is required in terms of f, L, and f,? What is the numerical 
value of A for f. = 10 cycles/em, f = 10 cm, and L = 2 cm? 
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Figure P7-12 


7-13 The Vander Lugt method is used to synthesize a frequency-plane filter. As 
shown in Fig. P7-13 (top), a “signal’’ transparency with amplitude transmittance 
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Figure P7-13 
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s(z,y) is placed immediately against a converging lens (rather than in the front 
focal plane) and the photographic plate records the intensity in the back 
focal plane. The amplitude transmittance of the developed plate is made 
proportional to exposure, and the resulting transparency is placed in the system 
of Fig. P7-13 (bottom). Assuming that the appropriate portions of the output 
plane are examined in each case, what should the distance d between the input 
plane and the first lens be in order to synthesize: 

(a) A filter with impulse response s(z,y) ? 

(b) A “matched” filter with impulse response s*( —z, —y)? 


7-14 A certain radar transmits a so-called chirp waveform which can be described 
by the equation 
()= Gr) 
s(t) = rect | — } cos { 2zf,¢ + — ? 
T 2r 
This wave is reflected from a stationary target at unknown range from the 
radar. 
(a) How might optical techniques be used to perform a matched-filtering 
operation on the waveform received during an interval T seconds long following 
transmission of the pulse? 
(b) How can the range of the target be deduced from the output of the optical 
processor? 
(c) How might the signals received during a multitude of T-second range sweeps 
be filtered simultaneously by a single processor? 
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8 WAVEFRONT-RECONSTRUCTION 
IMAGING, OR HOLOGRAPHY 


In 1948, Dennis Gabor proposed [Ref. 8-1] a novel two-step, lensless 
imaging process which he called wavefront reconstruction. Gabor recognized 
that when a suitable coherent reference wave is present simultaneously 
with the light diffracted by an object, then information about both the 
amplitude and phase of the diffracted waves can be recorded in spite of 
the fact that photographic film responds only to intensity. He demon- 
strated that from such a recorded interference pattern (which he called 
a hologram, meaning a “total recording’’), an image of the original object 
can ultimately be obtained. 

While this imaging technique—which we now know as holography— 
received only mild interest in its early days, recent improvements of 
technique and technology have vastly extended its applicability. As a 
consequence, holography now plays a role of great importance in the 
modern theory of image formation. 


8-1 HISTORICAL INTRODUCTION 


Gabor was influenced in his initial studies of wavefront reconstruction 
by previous work of W. L. Bragg in x-ray crystallography [e.g., Ref. 8-2], 
but was primarily motivated by possible applications of the newfound 
technique to electron microscopy. Gabor followed his original proposal 
with two more lengthy papers [Refs. 8-3, 8-4], published in 1949 and 1951, 
considering the application of wavefront reconstruction to microscopy. 
While the envisioned application to electron microscopy has, for practical 
reasons, not yet been successful, nonetheless microscopy has remained an 
important motivating force in holography. 

In the 1950s, a number of authors, including G. L. Rogers [Ref. 8-5], 
H. M. A. El-Sum [Ref. 8-6], and A. Lohmann [Ref. 8-7] significantly 


extended the theory and understanding of wavefront-reconstruction 
imaging. 
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It was not, however, until the early 1960s that the modern revolution 
in holography began. Again it was workers at the University of Michigan’s 
Radar Laboratory, in particular E. N. Leith and J. Upatnieks [Ref. 8-8], 
who recognized the similarity of Gabor’s wavefront-reconstruction process 
to the synthetic-aperture-antenna problem and suggested a modification 
of Gabor’s original technique that greatly generalized and improved the 
process. The Michigan workers soon coupled this new development with 
the emerging laser technology in order to perform three-dimensional 
photography [Ref. 8-9]. The quality and realism of the three-dimensional 
images obtained by holography have, to a large extent, been responsible 
for the great popular interest in the wavefront-reconstruction technique. 

The development of new and more powerful techniques of holography 
in the early 1960s was soon followed by a variety of proposed and demon- 
strated applications of the process. Contrary to popular impression, many 
of the most interesting and potentially useful properties of holograms 
were found to be quite independent and separate from the three-dimen- 
sional imaging capability, as we shall see in some detail in Sec. 8-9. 


8-2. THE WAVEFRONT-RECONSTRUCTION PROBLEM 


The particular problem to be addressed here is that of recording, and later 
reconstructing, the amplitude and phase of an optical wave arriving from 
a coherently illuminated object. This problem is, of course, sufficiently 
general to be of interest for electromagnetic waves in all regions of the 
spectrum, as well as for acoustic waves. Our considerations here, however, 
will be restricted to the optical problem. The general approach adopted 
is similar to that introduced by Collier [Ref. 8-10]. 


Recording amplitude and phase 


As indicated above, the wavefront-reconstruction process must consist of 
two distinct operations: a recording or information-storage operation, 
and a final reconstruction operation. For the moment we restrict attention 
to the first of these operations. 

Since the wavefronts of concern are coherent, it is necessary to store 
information about both the amplitude and phase of the waves. However, 
all recording media available respond only to light intensity. It is there- 
fore necessary that the phase information be somehow converted to inten- 
sity variations for recording purposes. A standard technique for accom- 
plishing this task is interferometry; that is, a second coherent wavefront 
of known amplitude and phase is added to the unknown wavefront, as 
shown in Fig. 8-1. The intensity of the sum then depends on both the 
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Figure 8-1 Interferometric recording. 


amplitude and the phase of the original wavefront. Thus, if 


a(z,y) = a(z,y) exp [—j4¢(z,y)] (8-1) 
represents the wavefront to be recorded, and 
A(z,y) = A(x,y) exp [—j¥(2,y)] (8-2) 


represents the “reference” wavefront with which a interferes, the intensity 
of the sum is given by 


5 (x,y) = |A(a,y)|? + la(z,y)|? + 2A (@,y)a(z,y) cos [P(a,y) — ne 
8-3 


While the first two terms of this expression depend only on the intensities 
of the two waves, the third depends on their relative phases. Thus infor- 
mation about both the amplitude and phase of a has been recorded. Such 
a recording of the pattern of interference between two wavefronts may be 
regarded as a hologram. 

At this point we have not specified any detailed character of the 
reference wave A. Properties which the reference should or must satisfy 
will become evident as the discussion progresses. 


The recording medium 


We shall assume here that the recording medium is photographic plate 
or film. While holograms have been recorded by other devices, including 
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photochromic glasses and electronic detectors, film remains the most 
important and widely used recording medium. 

As on several previous occasions (e.g., Secs. 7-5 and 7-7), the photo- 
graphic material will be assumed to provide a linear mapping of intensity 
incident during exposure into amplitude transmittance after develop- 
ment. Thus we assume that the variations of exposure remain within a 
linear region of the t-EZ curve appropriate for that particular film (effects 
of nonlinearities of the t-E curve are briefly considered in Sec. 8-6). In 
addition, it is assumed that the MTF of the emulsion extends to suffi- 
ciently high frequencies to record all of the incident spatial structure 
(effects of removing this assumption are examined in Sec. 8-5). Finally, 
we assume that the intensity |A|? of the reference is uniform across the 
recording surface, in which case the amplitude transmittance of the 
developed film may be written 


(zy) = hb + BY(lal? + A*a + Aa*) (8-4) 


where t, is a uniform “bias” transmittance established by the reference, 
and @’ is the product of the slope 8 of the t-E curve at the bias point and 
the exposure time. Note that again @’ is a negative number for a negative 
transparency, and a positive number for a positive transparency. 


Reconstruction of the original wavefront 


Once amplitude and phase information about the object wave a has been 
recorded, it remains to reconstruct that wave. Suppose that the developed 
transparency is illuminated by a coherent reconstruction wave B(z,y). 
The light transmitted by the transparency is evidently 


B(x,y)t;(v,y) = tsB + B’aa*B + f’A*Ba + ’ABa* 
=U,+U,4+U,4+ U, (8-5) 


Now note that if B is simply an exact duplication of the original reference 
wavefront A, the third term of this equation becomes 


Us(z,y) = B’|Ala(z,y) (8-6) 


Recalling our assumption that the intensity |A|? of the reference is 

uniform, it is clear that U; is, up to a simple multiplicative constant, an 

exact duplication of the original object wavefront a, as shown in Fig. 8-2a. 

In a similar fashion, if B(z,y) happens to be chosen as the conjugate 

of the reference wave, i.e., as A*(x,y), the fourth term of the reconstructed 
field becomes 

U,(2,y) = B’|Al|?a* (8-7) 
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Figure 8-2 Wavefront reconstruction with 
(a) the original reference wave 
A as illumination, and (b) the 
conjugate reference wave A* as 
illumination. 


which is proportional to the conjugate of the original object wavefront. 
This case is shown in Fig. 8-20. 

Note that, in either case, the particular field component of interest 
(that is, U; when B = A and U, when B = A*) is accompanied by three 
additional field components, each of which may be regarded as extraneous 
interference. Evidently if a usable duplication of a (or of a*) is to be 
obtained, some method of separating the various components of trans- 
mitted light is required. 


Linearity of the wavefront-reconstruction process 


The characteristic behavior of the photographic film, as represented 
by Eq. (8-4), corresponds to a highly nonlinear mapping of fields incident 
during exposure into fields transmitted after development. It would 
therefore appear, at first glance, that linear systems concepts can play 
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no role in the theory of wavefront reconstruction. Fortunately, however, 
this is not the case. While the overall mapping introduced by the film is 
nonlinear, nonetheless the mapping of the object field a into the single 
transmitted field component U; is entirely linear, as evidenced by the 
simple proportionality of Eq. (8-6). Similarly, the mapping of a into the 
transmitted field component U,, as represented by Eq. (8-7), is a linear 
one. Thus if the object field a is regarded as an input, and the transmitted 
field component U; (or Us) is regarded as an output, the system so defined 
is a linear one. The nonlinearity of the film manifests itself in the genera- 
tion of a number of output terms, but there is no nonlinear distortion 
of the one output term of interest, assuming the exposure variations 
remain in a linear region of the t-E curve. 


Image formation by wavefront reconstruction 


To this point we have considered only the problem of reconstructing a 
wavefront which arrived at a recording surface from a coherently illumi- 
nated object. It requires but a small change in point of view to regard 
the wavefront-reconstruction process as a means of zmage formation. 

To adopt this point of view, note that the wave component U; of 
Eq. (8-6), being simply a duplication of the original object wave a, must 
appear to an observer to be diverging from the original object, in spite 
of the fact that the object has long since been removed. Thus when the 
reference wave A is used as the illumination during reconstruction, the 
transmitted wave component U; may be regarded as generating a virtual 
image of the original object. This case is illustrated in Fig. 8-3a,b for the 
case of a simple point-source object. 

In a similar fashion, when the conjugate of the reference wave, A*, 
is used as the illumination during reconstruction, the wave component U, 
of Eq. (8-7) also generates an image, but this time it is a real image which 
corresponds to an actual focusing of light in space. To prove this assertion, 
we invoke the linearity property discussed above, considering an object 
which consists solely of a single point source. The corresponding result 
for a more complicated object may be found by linear superposition of 
point-source solutions. 

Incident on the film we have the sum of the reference wave A and 
a simple spherical wave 


a(z,y) = a, exp [jk V2.2 + (@ — 4)? + (y — Go)? (8-8) 


where (4.,f.) are the (x,y) coordinates of the object point, and 2, is its 
normal distance from the recording surface. Illuminating the developed 
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Figure 8-3 Imaging by wavefront reconstruction. (a) Recording the 
hologram of a point-source object; (b) generation of the 
virtual image; (c) generation of the real image. 
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hologram with A*, we obtain the transmitted wave component 


U.(z,y) = B|A)2a*(2,y) 
= p'|A\’a* exp [—jk Vz? + (@ — 4)? + (y — G.)*] (8-9) 


which is a spherical wave that converges toward a real focus at distance 
z. from the hologram, as shown in Fig. 8-3c. A more complicated object 
may be considered as a multitude of point sources of various amplitudes 
and phases; and by the linearity property, each such point source gen- 
erates its own real image as above. Thus a real image of the entire object 
is formed in this fashion. 

Note that the amplitude of the wave described by Eq. (8-9) is pro- 
portional to a¥, the conjugate of the original object point-source amplitude. 
Similarly, for a more complicated object, the real image generated by the 
hologram is always the complex conjugate of the original object. Such a 
change of phase does not affect the image intensity, but it can be impor- 
tant in certain applications which utilize both the amplitude and the 
phase of the image (see Sec. 8-9). 

It should again be emphasized that we have considered, in each case, 
only one of the four wave components transmitted by the hologram. 
This approach is acceptable if, by a proper choice of reference, the unde- 
sired components are suppressed or are separated from the image of 
interest. When this is not the case, the interference of the various com- 
ponents of transmitted light must be taken into account. 


8-3 THE GABOR HOLOGRAM 


Keeping in mind the preceding general discussion, we now consider the 
wavefront-reconstruction process in the form originally proposed and 
demonstrated by Gabor. Later (in Sec. 8-4) we turn to modifications of 
the process which improve its imaging capabilities. 


Origin of the reference wave 


The geometry required for recording a Gabor hologram is illustrated in 
Fig. 8-4. The object is assumed to be highly transmissive, with an 
amplitude transmittance 


t(re;Yo) =t+ At(Xo,Yo) (8-10) 


where t, is a high average level of transmittance, At represents the varia- 
tions about this average, and 


|At| < |to| (8-11) 
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Figure 8-4 Recording a Gabor hologram. 


When such an object is coherently illuminated by the collimated 
source shown in Fig. 8-4, the transmitted light consists of two components: 
(1) a strong, uniform plane wave passed by the term t., and (2) a weak, 
scattered wave generated by the transmittance variations At(2.,y.). The 
intensity of light incident on a photographic plate at distance z, from the 
object may be written 


S(x,y) = |A + a(z,y)|? 
= A? + |a(z,y)|? + Aa(z,y) + Aa*(z,y) (8-12) 
where A is the amplitude of the plane wave, and a(z,y) is the amplitude 
of the scattered light. 

Thus the object has, in a sense, supplied the required reference wave 
itself through the high average transmittance t,. The interference of the 
directly transmitted light with the scattered light results in a pattern of 
intensity which depends on both the amplitude and the phase of the 
scattered wave a(z,y). 


The twin images 


The developed photographic transparency is assumed to have an ampli- 
tude transmittance which is proportional to exposure. Thus 


(zy) = t + B’(la|? + Aa + Aa*) (8-13) 


If the transparency is now illuminated by a normally incident plane wave 
of uniform amplitude B, the resulting transmitted field amplitude con- 
sists of a sum of four terms: 


Bt; = Bh + B’Bla(x,y)|? + B’ABa(a,y) + B’ABa*(a,y) (8-14) 


The first term is a plane wave which passes directly through the trans- 
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parency, suffering a uniform attenuation but without scattering. The 
second term may be dropped as negligible by virtue of our assumption 
(8-11) which implies that 

la(z,y)|<« A (8-15) 


The third term represents a field component which is proportional to the 
original scattered wave a(x,y). This wave appears to originate at a virtual 
image of the original object located at distance z, from the transparency, 
as shown in Fig. 8-5. Similarly, the fourth term is proportional to a*(z,y) 
and, in accord with our earlier discussions, leads to the formation of a real 
image at distance z, on the opposite side of the transparency from the 
virtual image (see Fig. 8-5). 

Thus the Gabor hologram generates simultaneously real and virtual 
images of the object transmittance variations At, both images being 
centered on the hologram axis. These so-called twin images are separated 
by distance 2z,, and are accompanied by the coherent background Bt). 

Note from Eq. (8-14) that positive and negative transparencies yield 
different signs of the image-forming waves with respect to the coherent 
background (§’ is positive for a positive transparency, and negative for 
a negative transparency). Since the image-forming waves interfere with 
the background in the final viewing process, a positive hologram trans- 
parency is found to produce an image with positive contrast, and a 
negative hologram transparency, an image with negative (i.e., reversed) 
contrast. 


Limitations of the Gabor hologram 


The Gabor hologram is found to suffer from certain limitations which 
restrict the extent of its applicability. Perhaps the most important 
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Figure 8-5 Formation of the twin images. 
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limitation is inherent in the assumption (8-11) of a highly transparent 
object, and the consequent assumption (8-15) that followed. If this 
assumption is not adopted, there exists an additional wave component 


U(x,y) oy B’Bla(x,y)|? (8-16) 


transmitted by the hologram which can no longer be dropped as negligible. 
In fact, if the object is of low average transmittance, this particular wave 
component may be the largest transmitted term, and as a consequence 
may entirely obliterate the weaker images. Thus with a Gabor hologram 
it is possible to form wavefront-reconstruction images of an object con- 
sisting, for example, of opaque letters on a transparent background, but 
not of transparent letters on an opaque background. This restriction 
seriously hampers the use of Gabor holograms in many potential 
applications. 

A second serious limitation lies in the generation of twin images, 
rather than one image. The problem lies not with the presence of two 
images per se, but rather with their inseparability. When the real image 
is brought to a focus, it is always accompanied by an out-of-focus virtual 
image. Likewise, an observer focusing on the virtual image sees simul- 
taneously a defocused image arising from the real-image term. Thus even 
for highly transparent objects, the quality of the images is reduced by 
this twin-image problem. A number of techniques have been proposed for 
eliminating or reducing the twin-image problem [e.g., Ref. 8-7], including 
one technique originated by Gabor himself [Ref. 8-11]. The most successful 
of these proposals has been that of Leith and Upatnieks [Ref. 8-8], which 
will be discussed in detail in the section to follow. 


8-4 THE LEITH-UPATNIEKS HOLOGRAM 


The most widely used type of hologram is the so-called Leith-Upatnieks 
hologram, also known as an offset-reference hologram. The major difference 
between this type and the Gabor hologram is that, rather than depending 
on light directly transmitted by the object as a reference, a separate and 
distinct reference wave is introduced during the recording process. 
Furthermore, the reference is introduced at an offset angle, rather than 
being collinear with the object-film axis. 

The first successful demonstration of this type of hologram was 
published by E. N. Leith and J. Upatnieks in 1962 [Ref. 8-8]. It is of some 
historical interest to note that the original demonstration of this technique 
was accomplished without a laser source. However, not until the technique 
was coupled with highly coherent laser illumination did its full potential 
become evident [Refs. 8-9, 8-12]. 
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Figure 8-6 Recording a Leith-Upatnieks hologram. 


Recording the hologram 


One possible geometry for recording a Leith-Upatnieks hologram is 
illustrated in Fig. 8-6. A point source of illumination is collimated by 
the lens L. A portion of the resulting plane wave strikes the object, which 
is taken to be a transparency with a general amplitude transmittance 
t.(.,yo). A second portion of the plane wave strikes the prism P located 
above the object and is deflected downward at angle @ with respect to 
the film axis. Thus at the recording surface we find the sum of two coherent 
waves, one consisting of light transmitted by the object and the second 
consisting of a tilted plane wave. The amplitude distribution across the 
film may be written 


U(z,y) = A exp (—j2ray) + a(z,y) (8-17) 
where the spatial frequency a of the reference wave is given by 
sin 6 


x (8-18) 


a= 


The intensity distribution across the film is evidently 


I (x,y) = Ae == la(x,y)|? 
+ Aa(z,y) exp (j2ray) + Aa*(z,y) exp (—j2ray) (8-19) 


An alternative revealing form may be obtained by writing a explicitly 
as an amplitude and phase distribution 


a(zx,y) = a(x,y) exp [—7¢(x,y)] (8-20) 
and combining the last two terms of (8-19) to yield 
S(x,y) = A? + a(z,y) + 2Aa(z,y) cos [2ray — (x,y)] (8-21) 
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Figure 8-7 Reconstruction of the images. 


This expression demonstrates that the amplitude and phase of the light 
transmitted by the object have been recorded, respectively, as amplitude 
and phase modulations of a spatial carrier of frequency a. If the carrier 
frequency is sufficiently high (just how high it must be we shall see 
shortly), the amplitude and phase distributions can be unambiguously 
recovered from this pattern of interference. 


Obtaining the reconstructed images 


In the usual fashion, the photographic plate is developed to yield a trans- 
parency with amplitude transmittance proportional to exposure. Thus 
the film transmittance may be written 


t(v,y) = te + B’[la|? + Aa exp (72ray) + Aa* exp (—j2ray)] (8-22) 
For convenience we represent the four terms of transmittance by 


tects t; = B’Aa(z,y) exp (j2ray) 


/ 9 t . es 
te = B’la(x,y)|? ti = B’Aa*(x,y) exp (—j2ray) Ged 


For the present we assume that the hologram is illuminated by a 
normally incident, uniform plane wave of amplitude B, as illustrated 
in Fig. 8-7. The field transmitted by the hologram has four distinct 


components, each generated by one of the transmittance terms of 
Kq. (8-23): 


U, = tB U; = p’BAa(a,y) exp (j2ray) 


B'Bla(ey)|? Us = B'BAa*(2,) exp (—j2ray) 74) 


= 
T 
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The field component U, is simply an attenuated version of the incident 
reconstruction field, and thus represents a plane wave traveling down the 
transparency axis. The second term U, is spatially varying and therefore 
has plane-wave components traveling at various angles with the optical 
axis. However, as we shall see in more detail shortly, if the bandwidth 
of a(z,y) is small compared with the carrier frequency a, the energy in 
this wave remains sufficiently close to the transparency axis to be spatially 
separated from the images of interest. 

The wave U; is proportional to the original object wavefront a times 
a linear exponential factor. Proportionality to a implies that this term 
generates a virtual image of the object at distance z, from the transparency, 
while the linear exponential factor exp (j2ray) indicates that this image 
is deflected off the transparency axis at angle 0, as shown in Fig. 8-7. 
Similarly the wave U, is proportional to the conjugate wavefront a*, 
which indicates that a real image is formed at distance z, from the trans- 
parency, on the side opposite from the virtual image. The presence of the 
linear exponential factor exp (—j2ray) indicates that the real image is 
deflected at angle —@ from the transparency axis, as again can be seen 
in Fig. 8-7. 

The most important observation to be derived from these results is, 
of course, that while both real and virtual images are again generated by 
the process, they have been angularly separated from each other and from 
the wave components U,; and U,. This separation comes about due to the 
use of a reference with an angular offset; indeed, successful isolation of the 
real and virtual images requires the use of a reference angle @ which is 
chosen larger than some lower limit Onin (the minimum reference angle 
Omin Will be discussed in more detail shortly). When 6 exceeds Onin, the 
real and virtual images are not contaminated by other wave components. 

Note in addition that, since the images may be viewed independent 
of any coherent background generated by the average transmittance, 
the particular sign associated with the wave components U; and U, of Kq. 
(8-24) is immaterial. Thus the transparency may be either a positive or a 
negative; in each case a “positive” image is obtained. For practical reasons 
it is generally preferable to use negatives directly, thus avoiding the two- 
step process required for making positive transparencies. 

Finally we should point out that we have chosen to illuminate the 
hologram with a normally incident plane wave, which is neither a duplica- 
tion of the original reference wave nor of its complex conjugate, yet we 
have obtained a real and a virtual image simultaneously. Evidently our 
assumptions in Sec. 8-2 concerning the required nature of the reconstruc- 
tion illumination were overly restrictive. However, when the thickness of 
the recording emulsion is comparable with the transverse structure of the 


212 / INTRODUCTION TO FOURIER OPTICS 


Source 
pS 


? 


““On-axis’ 


\ waves 


Virtual Real 
image image 
(a) 
Source Hologram 
——— 
“On-axis” 
waves 
IS 
Be ae Z Real 
f WP image 
Virtual 
image 
(b) 


Figure 8-8 Alternative geometries for reconstructing the images. (a) 
Hologram illuminated by a duplication of the original 
reference wave; (b) hologram illuminated by the conjugate 
of the original reference wave. 


interference pattern incident during exposure, the exact nature of the 
reconstruction illumination becomes more critical. As will be discussed in 
Sec. 8-7, it is then crucial that the hologram be illuminated with a duplica- 
tion of the reference wave to obtain a virtual image, or with the complex 
conjugate of the reference wave to obtain a real image. It is left as an 
exercise for the reader (see Prob. 8-1) to show that the relationship 
between the reconstruction illumination, the hologram, and the images is 


as shown in Fig. 8-8 in the two respective cases. 


WAVEFRONT-RECONSTRUCTION IMAGING, OR HOLOGRAPHY / 213 


The minimum reference angle 


Returning to the reconstruction geometry of Fig. 8-7, if the twin images 
are to be separated from each other and from the light transmitted near 
the transparency axis, the offset angle 6 of the reference must be greater 
than some minimum angle Onin. To find this minimum, it suffices to deter- 
mine the minimum carrier frequency a for which the spatial-frequency 
spectra of ts; and t, (that is, the virtual-image and real-image terms of 
hologram transmittance) do not overlap each other, and do not overlap 
the spectra of t; and ts. If there is no such overlap, then in principle the 
hologram can be Fourier transformed with the aid of a converging lens, 
the unwanted spectral components can be removed by appropriate 
stops in the focal plane, and a second Fourier transformation can be per- 
formed to yield just that portion of the transmitted light which leads to 
one of the twin images. 

Consider the spatial-frequency spectra of the various terms of trans- 
mittance listed in Eq. (8-23). Neglecting the finite extent of the trans- 
parency aperture, we have directly that 


Gilfx,fy) = F{ti(z,y)} = td(fx,fy) (8-25) 


Using the autocorrelation theorem, we also have 


Gilfx,fy) = F{te(x,y)} = B’Gal(fx,fy) * Galfx,fy) (8-26) 
where G.(fx,fy) = F{a(z,y)} (8-27) 


and * indicates the autocorrelation operation. Finally we have 


G3(fx,fy) = F{ts(z,y)} = B’AGa(fx,fy — a) (8-28) 
and G.(fx,fy) = F{ta(z,y)} = B’AGI(—fx,—fy — a) (8-29) 


Now note that the bandwidth of G, is identical with the bandwidth 
of the object, for the two spectra differ only by the transfer function of the 
propagation phenomenon, which is the pure phase function of Eq. (3-47). 
Suppose that the object has no frequency components higher than B 
cycles/mm. Thus the spectrum |G,| might be that shown in Fig. 8-9a. 
The corresponding spectrum of the hologram transmittance is illustrated 
in Fig. 8-9b. The term |G,| is simply a 6 function at the origin of the fxfy 
plane. The term |G,|, being proportional to the autocorrelation function 


1 Spatial-filtering operations are seldom used in practice to separate the twin 
images. If the reference angle satisfies the requirements to be derived here, the images 
will separate of their own accord due to the different directions of propagation of the 
respective wave components (cf. Fig. 8-7). However, the spatial-filtering arguments 
do provide a conceptually simple way of finding sufficient conditions for separation. 
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Figure 8-9 Spectra of the (a) object and (b) hologram. 


of |G,|, extends to frequencies as high as 2B. Finally, |Gs3| is simply 
proportional to |G,|, displaced to a center frequency (0,a), while |Gs| is 
proportional to a reflected version of |G,| centered at frequency (0,—a). 
Examination of Fig. 8-9b shows that |G3| and |G,| can be isolated 
from |G,| if 

a > 3B (8-30) 

or equivalently, if 
sin 6 > 3B (8-31) 


Evidently the minimum allowable reference angle is given by 
hes = Sin" BYE (8-32) 


When the reference wave is much stronger than the object wave, this 
requirement can be relaxed somewhat. The term G; is generated physically 
by interference of light from each object point with light from all other 
object points, while G3 and Gy, arise from interference of the reference and 
object waves. When the object wave is much weaker than the reference 
wave (i.e., when a < A), the term Gy» is of much smaller magnitude than 
Gy, G;, and Gu, and can be dropped as negligible. In this case the minimum 
allowable reference angle is that which barely separates G; and Gy, from 
each other, or 


Osin = Sin” BX (8-33) 


Extension to uncollimated reference and reconstruction waves 


It is, of course, natural to consider more general geometries for recording 
holograms and reconstructing images. Of particular interest are the prop- 
erties of wavefront-reconstruction systems which use spherical reference 
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waves and spherical reconstruction waves.! Referring to Fig. 8-10a, we 
suppose that the reference wave is generated by a point source at coor- 
dinates (2;,y,,2,). Since the mapping of object amplitudes remains 
linear, regardless of the specific form of the reference wave (see Sec. 8-2), 
it suffices? to consider a single object point source at coordinates 
(%o,Yo,20.). AS shown in Fig. 8-10b, during the reconstruction step the 
transparency is illuminated by a spherical wave originating at coordinates 
(Lp,Yn%p). For additional generality, the wavelength \,. during recon- 


1 For more extensive discussions of this material, including consideration of the 
aberrations present, see Refs. 8-13 and 8-14. 

2 Implicit in this simplification is the assumption that the angular offset of the 
reference is sufficient to allow isolation of the real and virtual images. 
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Figure 8-10 Generalized (a) recording and (b) reconstruction 
geometries. 
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struction may differ from the wavelength \; used during the recording 


process. 
Using quadratic approximations to the spherical waves, the total 


field incident on the photographic film may be written 


Ulery) = Aexp {7=[@ — 2)? + — wo] 
+ a exp i [(% — ao)? + (y — 1] (8-34) 


where A and a are complex constants representing the relative amplitudes 
and phases of the two spherical waves. The corresponding intensity 
distribution across the interference pattern is thus 


S(x,y) = |Al? + Jal? 


a Tv 
A * iy a pe ae ae ps. Ne ey 
+ Aa exp [51 te) ee Ue aul Te, 


(et) we] 


+ A*a exp | een [a —2,)? + (y—y,)3] +55 Ie — x)? +(y— uw] 
(8-35) 


If the amplitude transmittance of the developed transparency is 
proportional to exposure, then the important terms of transmittance are 


ts = B’Aa* exp ere ea re) (Ue) | 


TT 


a eal cng -1)3} 


120 


A (8-36) 
t, = p’A*a exp {-35% Ae irene as MC 79 bal 


ete — 7) Gy — w)'] 


The hologram is illuminated by the spherical wave 


Tv 


INcrp=eiiss {3 eee wil (8-37) 


oe p 


The two wavefronts of interest behind the transparency are found by 
multiplying (8-36) and (8-37), yielding 


en eines i= Ce s wl 
ie (8-38) 
Westies [3 [We — a) + y— wil 
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To identify the nature of these transmitted fields, we must examine their 
(x,y) dependence. Since only linear and quadratic terms in x and y are 
present, the two expressions U; and Uy may be regarded as quadratic 
approximations to spherical waves; the presence of linear terms simply 
indicates that the waves are converging toward (or diverging from) points 
which do not lie on the z axis. It remains to determine the exact location 
of these (real or virtual) points of convergence. 

Note that the phase terms which depend on x? and y? determine the 
z coordinate of the foci, while the linear terms determine the off-axis 


coordinates. If the (x? + y?) terms are collected, we find exponentials of 
the form 


eae 1 
exp E (: oe + ) (x? + v)| (8-39) 
Nizy ~~ «Aaz, Ae 
where the top set of signs applies for one wave component (U3) and the 
bottom set for the other (U,). Comparing these forms with that of a 


spherical wave diverging from a point at distance z; to the left of the 
hologram in Fig. 8-106, 


ep [i @ +y)| (8-40) 


we find that the distance z; from the apparent point source to the hologram 


is given by 
1 Nene AN 
n= (222%) (8-41) 
Zp 12, 120 


where again the upper set of signs applies for one wave and the lower set 
for the second wave. When z; is positive, the image is virtual and lies to 
the left of the hologram in Fig. 8-10b; when z; is negative, the image is 
real and lies to the right of the hologram. 

While Eq. (8-41) specifies the z coordinates of the images, the x and 
y coordinates remain to be determined. These coordinates can be found by 
collecting the linear x and y terms of Eq. (8-38) and comparing them with 
the form 


2 @) € 
exp | -/ (ae + vw) | (8-42) 
oei 
The two terms of interest are 


Lo tr Lp — Yo Yr Yp ; 
—j = aE at (= ae = a y 
pay | tid IG \1Zo = \izr * 22.) \1Zo A1ér Nop 


(8-43) 
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Comparing (8-42) and (8-43), we identify 


22 NYA a 
a = $2, + —4,+ — 2 
\1Zo \iér Pp (8-44) 
doz; NYS Zi 
i) + to) a5 Yr ate ae, 
4 we der ee s 


as the off-axis coordinates of the two images. 
Note in particular that a change of object coordinates (Az,,Ayo) results 
in a change of image coordinates 


Thus there is a magnification 


Aa; 
AZo 


M = (8-45) 


=| 


he N22: 
AYo 


r 1Z0 


associated with the wavefront-reconstruction process in this generalized 
geometry! Substitution of (8-41) in (8-45) yields the equivalent expression 


(8-46) 


When a collimated reference wave (z, = ~) and a collimated reconstruc- 
tion wave (2, = ©) are used, the magnification can be seen to be unity, 
regardless of the wavelength ratio \i/A2.! A magnification of unity is also 
obtained for the virtual image when Az = ); and zy = z,, and for the real 
image when A» = A; and z, = —z2, (see Prob. 8-3). 


Extension to three-dimensional photography 


In 1964, Leith and Upatnieks reported the first successful extension of 
holography to three-dimensional imagery [Ref. 8-9]. Success in this 
endeavor rested to a large degree on the availability of the HeNe laser, 
with its excellent temporal and spatial coherence. 

Figure 8-1la illustrates the general geometry used for recording 
holograms of three-dimensional scenes. Coherent light illuminates the 
scene of interest. In addition, a portion of the illumination strikes a 
“reference” mirror which is placed next to the scene. Light is reflected 
from the mirror directly to the photographic plate, where it serves as a 
reference wave, interfering with the light reflected from the scene itself. 


1 The axial image distance z; does depend on )/d2, however. 
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Figure 8-11 Holographic imaging of a three-dimensional scene. 
(a) Recording the hologram; (b) reconstructing the 
images. 


Thus the photographic plate records a hologram of the three-dimensional 
scene. 

To reconstruct images, the developed plate is illuminated by a 
reconstruction wave, as shown in Fig. 8-116. The virtual image appears 
behind the plate, fixed in three-dimensional space. Since the wavefronts 
originally incident on the plate have been duplicated during reconstruc- 
tion, the image retains all three-dimensional properties of the original 
object. In particular, parallax effects are readily noticed, the observer 
being able to “look behind” objects in the foreground simply by changing 
his viewing position. 

The real image generated by the process forms in space between the 
photographic plate and the observer. For three-dimensional objects, this 
image has certain properties which make it of less utility than the virtual 
image. First, points on the object which were closest to the photographic 
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Figure 8-12 Experimental apparatus for recording a hologram of a three- 
dimensional scene. S, laser source; M,, mirror; BS, beam splitter; M 2, 
mirror; O, object scene; R, reference mirror; P, photographic plate. 


plate (and therefore closest to an observer of the original scene) appear 
in the reconstructed real image again closest to the photographic plate, 
which in this case is farthest from the observer (cf. Fig. 8-11b). Thus to an 
observer of the real image, the parallax relations are not those associated 
with the original object, and the image appears (in a certain peculiar 
sense which must be personally observed to be fully appreciated) to be 
‘Snside out.”’ Images of this type are known as pseudoscopic. 

As a second disadvantage, if photographic film is inserted in an 
attempt to record the real image directly, the experimenter soon discovers 
that (for holograms of reasonable size) the depth of focus is generally 
so small that a recognizable recording cannot be obtained. This problem 
can be alleviated by illuminating only a small portion of the hologram, 
in which case the depth of focus is increased and a usable two-dimensional 
image can be recorded. If the illuminating spot on the hologram is moved, 
then the apparent perspective of the two-dimensional image changes. 
Thus every small region of a large hologram is capable of producing a real 
image of the original object with a different perspective! 

Figure 8-12 shows an experimental apparatus used for recording 
holograms of three-dimensional scenes. Light from the laser S is sent, via 
the mirror M,, to a beam splitter BS. A portion of the light passes through 
the beam splitter and is directed by the mirror M2 to the three-dimensional 
scene, which in this case consists of two small figurines in front of a sign. 
A second portion of the light is reflected by the beam splitter BS to 
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illuminate the reference mirror R, in which there happens to be seen a 
reflected image of one of the figurines. Light from the reference mirror 
and from the scene interferes on the photographic plate P. 

Figure 8-13 is a photograph of the developed hologram itself. Note 
that there is no recorded structure that can be identified with any par- 
ticular part of the scene. In fact, most of the observable structure can be 
said to be irrelevant to the reconstruction in the sense that it arises from 
imperfections in the optical apparatus (e.g., from dust particles on mirrors). 
The structure which actually generates the reconstructed image is far too 
fine to be resolved in this photograph. 

Figure 8-14 illustrates the manner in which the virtual image may be 
viewed. The laser is in this case not in the picture, and only the mirror M 
which deflects the beam can be seen. Behind the illuminated hologram a 
virtual image of the horse figurine is seen. 

To illustrate the truly three-dimensional character of the virtual 
image, we refer to Fig. 8-15, which shows two photographs of the virtual 
image. In Fig. 8-15a, the camera is focused on the background of the 
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Figure 8-14. Experimental apparatus for viewing the virtual image. 


virtual image; the sign in the background is sharply in focus, while the 
figurines in the foreground are out of focus. Note also that the tail of the 
horse obscures the head of the shadow of the dog. The camera is next 
refocused on the foreground and moved to change the perspective. Figure 
8-15b shows the image obtained, with the foreground now in focus and the 
background out of focus. The tail of the horse no longer obscures the head 
of the shadow of the dog, a consequence of the change of perspective. 
Thus the camera has succeeded in “looking behind” the tail by means of a 
simple lateral movement. 


Practical problems 


Certainly the most prominent practical problem since the inception of 
holography has been that of obtaining a source of adequately high coher- 
ence and intensity. With the advent of the laser, this problem has been 
greatly alleviated. However, the availability of laser sources has led to 
new and more sophisticated applications of holography, such as the three- 
dimensional photography described above. Many of these new applica- 
tions tax the capabilities of relatively low power CW lasers, and the source 
technology remains a major constraint on new applications of holography. 
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Figure 8-15 Photographs showing the three-dimensional character of the virtual 
image. 
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The process of recording a hologram is little more than an exercise 
in interferometry. As with any interferometric experiment, if clean and 
sharp interference fringes are to be recorded, it is essential that the optical 
apparatus be stable to within a fraction of a wavelength during the 
exposure period. The higher the intensity of the source, the shorter the 
required exposure period, and the less severe the stability requirements. 
The exposure time required in any particular case depends on a variety 
of factors, including the transmissivity or reflectivity of the object, the 
distances and geometry involved, and the particular film used. Exposure 
times as short as a tenth of a second and as long as several hours are not 
uncommon. In some cases a pulsed laser may be used as the source, with 
exposure times as short as a few nanoseconds. 

Some of the most stringent experimental requirements are associated 
with the holography of three-dimensional scenes. Films of extremely high 
resolution are required to record holograms of such objects (see Sec. 8-5 
for further discussion of film-resolution requirements). The most com- 
monly used film for this task is Kodak Spectroscopic Plate Type 649F, 
with a resolution capability better than 2,000 lines/mm. Unfortunately, 
high-resolution emulsions are notoriously insensitive (the equivalent ASA 
rating of 649F plate is about 0.03, as compared with Tri-X with a rating 
of 400). Low sensitivity leads, of course, to longer exposure times and 
rather severe requirements on stability. 

An additional practical problem of significance in some cases is the 
limited dynamic range of photographic recording media. With reference 
to Fig. 7-11, the amplitude transmittance is linearly proportional to 
exposure over only a limited range. It is, of course, desirable to choose the 
average exposure to fall at about the midpoint of the linear region. How- 
ever, when the object is, for example, a transparency with rather coarse 
structure, there may exist significant areas on the hologram with exposures 
well above and well below the linear region of the t-E curve. As a conse- 
quence of this nonlinearity, we might expect some distortion of the image 
contrast. For a more detailed discussion of the effects of nonlinearities, 
see Sec. 8-6. 

The dynamic-range problem can to a large degree be overcome by a 
technique first demonstrated by Leith and Upatnieks [Ref. 8-9]. The 
object is illuminated through a diffuser, as shown in Fig. 8-16. Since the 
diffuser scatters light over a wide range of angles, the light from each small 
area on the object is spread over the entire photographic plate, thus 
eliminating areas of very strong and very weak exposure which would 
otherwise be obtained with a directly illuminated object. As might be 
expected, the virtual image obtained in this case appears to be back- 
lighted with diffuse illumination. 
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Figure 8-16 The use of diffuse illumination for recording 
a hologram. 


8-5 EFFECTS OF THE FILM MTF 


To this point it has been tacitly assumed that the MTF of the photo- 
graphic emulsion used for recording the hologram is unity for all spatial 
frequencies of the incident interference pattern. Attention is now turned 
to considering the effects of film frequency response on the quality of 
the reconstructed images. We present analyses for two particularly 
important recording geometries. For further discussions of this subject, 
including an analysis applicable to a more general geometry, the reader is 
referred to the work of R. F. van Ligten [Refs. 8-15, 8-16]. 


Collimated reference wave 


We examine first the effects of the film MTF on the mapping of object 
amplitudes into real-image amplitudes when a plane reference wave is 
used. Similar results may be applied to any real image formed from the 
virtual image by means of an auxiliary system of lenses. 

As discussed previously, if the exposure variations remain within a 
linear region of the t-E’ curve, the object-image amplitude mapping is a 
linear one (see Sec. 8-2). This linearity is not destroyed by the linear 
processes which reduce the frequency response of the film. Therefore it 
suffices to find the effects of the MTF on one general frequency com- 
ponent of the object, and to construct the more general result by linear 
superposition. 

To this end, consider the reference and object waves to be given by 


U,(x,y) = A exp (—j2ray) Be 
U.(z,y) = a exp [—Jj2r(fxx + fry)] (8-47) 
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Figure 8-17 Effects of the film MTF: collimated reference wave. 


respectively. Thus the object and reference waves which arrive at the 
emulsion are plane waves propagating in different directions. The distri- 
bution of exposing intensity is therefore 


S(x,y) = A? + jal? + 2Alal cos {2r[fxx + (fy — ay] + o} (8-48) 


where ¢ = arg a. If we represent the frequency response of the emulsion 
by M(fx,fy); where 


M(fx,fy) = M(fx,fr) exp [—jQ(fx,Fy)] (8-49) 
then, from Eq. (7-15), the effective exposing intensity distribution is 


Set(x,y) = A? + lal? + 2Ala|M(fx,fy — a) 
cos {2n[fxx + (fy — «)y] — Q(fx,fy — a) + o} (8-50) 


We conclude that the particular Fourier component of the object with 
spatial frequency (fx,fy) is attenuated by the factor M(fx,fy — a) and 
phase shifted by Q(fx,fy — «), relative to the amplitude and phase 
obtained with a perfect emulsion (i.e., an emulsion with M = 1 for all 
frequencies). 

Figure 8-17 illustrates this result pictorially. The spectrum of the 
object is assumed to be centered at frequency (fx = 0, fy = 0). The 
MTF of the emulsion is erected about the frequency (fx = 0, fy = a); 
the product of the object spectrum and the film MTF yields the image 
spectrum. 

From the preceding analysis, it is clear that, when a collimated 
reference wave is used, the effect of the film frequency response may be 
represented by a simple multiplicative factor M(fx,fy — a) applied to 
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the object spectrum. This result directly implies that a coherent transfer 
function 


H(fx,fy) = M(fx,fy — a) 


might be associated with the imaging process. However, the spatial fre- 
quency response of the system is limited not only by the film MTF, but 
also by diffraction effects that arise due to the finite aperture of the film 
transparency. Thus, if P(x,y) represents the pupil function of the trans- 
parency, i.e., if 


1 (x,y) in the film aperture 
P = : : 
(259) 0 otherwise 


then the complete coherent transfer function may be written 
H(fx,fy) = P(Adifx,Adify)M(fx,fy — a) (8-51) 


where the first factor P appears by direct analogy with Eq. (6-19). Note 
that the effect of the film MTF may be regarded as entirely equivalent to 
that of inserting an attenuating and phase-shifting mask with amplitude 
transmittance 


ny 
zy) = ) = 8-52 
H(z,y) M (= dd; «) (oe 


across the entrance pupil of the wavefront-reconstruction imaging system. 


The lensless Fourier transform hologram 


We consider next an alternative recording geometry for which the effects 
of the film MTF are markedly different than in the preceding case. The 
recording geometry to be considered here is shown in Fig. 8-18; the refer- 
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Figure 8-18 Recording a lensless Fourier transform 
hologram. 
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ence is now aspherical wave, diverging from a point which is coplanar with 
the object. For reasons which will soon be evident, a hologram recorded 
in this particular geometry may be called a lensless Fourier transform 
hologram [Ref. 8-17]. 

To investigate the properties of this type of hologram, we again 
invoke the linearity of the imaging process, but this time considering the 
response of the imaging system to a single point-source object, rather than 
a plane-wave object. Representing the coordinates of the reference and 
object point sources by (z,,y,) and (x.,yo), respectively, the corresponding 
amplitude distributions across the emulsion may be written (in a quad- 
ratic approximation) as 


U,(x,y) =A exp f [(x 27 LA = (y aa v3] 


de 
Use) = oes {JZ [ee — 2)? + — wo) 4] 


The intensity incident during exposure is thus 


ae 
dz 


clit 2p eo == aCenttito) (8-53) 


S(x,y) = A? + lal? + 2A]al| cos | 2 


where 0(#;,Yr}%0o,Yo) 18 a phase angle which depends on (z,,y,) and (2Xo,Yo) 
but not on the film coordinates (x,y). 

The reason for the name lensless Fourier-transform hologram is now 
evident. The wave generated by the object point at coordinates (2x.,Yo) 
interferes with the reference wave to produce a sinusoidal fringe pattern 
with spatial frequencies 


Xo oo Lr 
hz 
Yo — Yr 
dz 


fx = 
(8-54) 


fy 


Thus for this particular recording geometry there is a one-to-one corre- 
spondence between object coordinates and spatial frequencies on the holo- 
gram. Such a mapping is characteristic of a Fourier transform operation, 
but it has been achieved without the use of a transforming lens. 

To obtain an image from this hologram, the transparency is illumi- 
nated and followed by a positive lens, as shown in Fig. 8-19. As evidenced 
by Eq. (8-41) with zp = © and z = 2z,, the twin images generated by the 
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Figure 8-19 Obtaining images from the lensless Fourier transform 
hologram. 


hologram itself both lie at infinite distance from the transparency. The 
positive lens brings these infinitely distant images to a real focus at one 
focal distance f behind the lens. 

To find the effects of the film MTF on the reconstructed images, we 
combine Eqs. (7-15) and (8-53), yielding an effective intensity distribution 


Ge(z,y) = A? + lal? + 2AlalM (" Er oes u) 
dz dz 


Lo — Lr a= Yr 
pone 
dz 


cos | 2 


+ 8(rYr3t0Yo) — 2 (= | | (8-55) 
dz dz 

The phase shift 2 introduced by the film MTF will affect only the phase 
of the light at the image point corresponding to (a,,y.), and therefore it is 
of no concern. However, the factor M corresponds to an attenuation of 
the light which contributes to the image of the point (z.,y,). Since object 
points farthest from the reference point generate fringes with the highest 
spatial frequencies on the hologram, the images of such points are attenu- 
ated most severely. 

Thus for the lensless Fourier transform recording geometry, the 
limited frequency response of the film has the effect of restricting the 
field of view about the reference point, but does not influence the resolution 
obtained within this field of view. If finax represents, loosely speaking, the 
maximum spatial frequency that can be resolved by the emulsion, then 
only those object points with coordinates satisfying 


V (Go — Br)? + (Yo — Yr)? < defnex (8-56) 


will actually appear in the image. While for a collimated reference wave 
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the effects of finite film resolution have been seen to be equivalent to those 
of a mask placed across the entrance pupil [see Eq. (8-52)], for this second 
recording geometry they are equivalent to those of a mask placed across 
the object itself. The transmittance of the equivalent mask is in this case 
given by 


Lo — Lr Yo Le Yr 
= ) 8-57 
t(Xo,Yo) M ( xe v; ) ( ) 


Since each object point is weighted by this imaging system in a 
manner which depends on its particular coordinates, the system is not 
space-invariant, and strictly speaking, there exists no coherent transfer 
function which can be associated with it. However, from a practical point 
of view, it is possible to divide the object space into so-called zsoplanatic 
regions which are sufficiently small to ensure that Eq. (8-57) is approxi- 
mately constant over any one region, and to specify a coherent transfer 
function appropriate for each such region. In such a case, the coherent 
transfer function may be written 


H(fx,fy) = MP(Adifx,ddify) (8-58) 


where M is the particular value of Eq. (8-57) appropriate for the isoplana- 
tic region in question, and P is again the pupil function that describes the 
effective pupil of the system. 


More general recording geometries 


A complete analysis of the more general recording geometry illustrated 
previously in Fig. 8-9 has been carried out by van Ligten [Ref. 8-16], but 
is too lengthy to be duplicated here. The analysis demonstrates that the 
effects of the film MTF are in all cases equivalent to those of a mask, 
again with amplitude transmittance proportional to a scaled version of M, 
placed at a certain position between the object and the film. The particular 
position of the equivalent mask depends on the recording geometry used. 
The two geometries examined in detail in this section simply represent 
limiting cases for which the location of the effective mask is against the 
entrance pupil on the one hand, and against the object on the other hand. 

For further consideration of the effects of the film MTF, the reader 
may consult the references cited; additional material is also presented in 


Prob. 8-5. 
8-6 EFFECTS OF FILM NONLINEARITIES 


Throughout our discussions of holography we have repeatedly assumed 
that the recording medium is exposed in such a way as to assure operation 
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within a linear region of the amplitude transmittance vs. exposure curve. 
However, real photographic films and plates are never perfectly linear in 
this respect, the deviation from linearity depending to a large degree on 
the magnitude of the variations of exposure to which the medium is sub- 
jected. We present in this section a brief discussion of the effects of film 
nonlinearities on the reconstructed image. It should be emphasized that, 
when the average exposure produced by the object is comparable with 
that produced by the reference, nonlinear effects present perhaps the most 
serious limitation to image quality. This case may be contrasted with that 
of a very weak object, for which film-grain noise is generally the limiting 
factor [Ref. 8-18]. 


The Kozma model 


A useful model for analyzing the effects of film nonlinearities on the wave- 
fronts reconstructed by holography has been discussed by Kozma [Ref. 
8-19]. The Kozma approach treats the photographic emulsion much as a 
nonlinear system element is treated in communication theory problems 
[Refs. 8-20, 8-21]. The input to this element is taken to be the pattern 
of exposure to which the emulsion is subjected. The output is the pattern 
of amplitude transmittance of the developed transparency. 

For simplicity, we assume a plane reference wave, writing the refer- 
ence and object field distributions, respectively, as 


U,(z,y) = A exp (—j2ray) 
U.(x,y) = a(x,y) exp [—j¢(z,y)] 


The total exposure may accordingly be written 


E(z,y) = |U,-(x,y) + U,(z,y) |? 
= AT + a%(x,y)T + 2Aa(z,y)T cos [2ray — o(z,y)] (8-60) 


(8-59) 


where 7’ is the exposure time. We may regard this exposure distribution 
as consisting of a relatively uniform exposure H, = A*T7’ contributed by 
the reference, plus variations £;(v,y) about H,, where 


Ex(x,y) = a(2,y)T + 2Aa(zx,y)T cos [2ray — $(x,y)] (8-61) 


The amplitude transmittance ¢(x,y) of the developed transparency 
may be represented by a relatively constant “‘bias” transmittance t, intro- 
duced by the reference, plus variations of transmittance t:(7,y). Thus 


t(a,y) = tb + ti(x,y) (8-62) 


It is the functional dependence of t; on ZH; [that is, the function ¢:(/1)] 
which determines the linearity of the process. For later convenience, we 
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Figure 8-20 Film nonlinearities. (a) 
The t-E curve; (b) the 
dependence of t,; on Ei; 
(c) the function g(F,). 


prefer to represent this functional dependence by g(/1), where 
gi) = —thy) (8-63) 


Figure 8-20 illustrates typical dependences of ¢ on E#, ¢; on Hj, and finally 


g on Fj. 
The Kozma model is now shown in detail in Fig. 8-21. The constant 
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exposure EF, is subtracted from H(x,y), and the remainder E,(x,y) is 
applied to the nonlinear element with characteristic g(H1). The output of 
this element is subtracted from the constant transmittance to yield the 
total amplitude transmittance ¢(x,y). It remains, of course, to find the 
effect of the nonlinear transformation g(#,;) on the fields transmitted by 
the transparency. 


The transform method of analysis 


One possible method for analyzing the Kozma model is the so-called 
transform method of communication theory [Refs. 8-20, 8-21]. This method 
is based on a purely mathematical device. Let the Fourier transform of the 
nonlinear characteristic g(£1) be defined by 


GuyS jhe g(E1) exp (—j2xfE) dEy (8-64) 


This transform has no real physical significance, for it is not a “‘spectrum”’ 
of a time or space function in the usual sense of the word. Nonetheless, 
its usefulness becomes evident when g(#1) is expressed in terms of its 
transform G(f), 


g(B) = [7 , GUA) exp (72x) df 
= [7 GU) exp [2afor%(e,y) 7) 
exp {j4afAa(z,y)T cos [2ray — $(x,y)]} of 


The second exponential factor may be expanded with the Jacobi-Anger 
formula [Ref. 8-22], 


eo 


exp (jy cos 6) = )) 6j’J,(y) cos v8 (8-65) 


v=0 


where ¢, is the Neumann factor 


1 p= 0 
ae y > 0 ( ) 


Figure 8-21 The Kozma model. 
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and J, is a Bessel function of the first kind, order ». Thus 


oO 


g(a) = Y ed [ [7, GU) exp (j2afa?T)J.4nT Aap) df | cos 2rvay — v6) 
0 


v= 


and finally, 
t(x,y) =t —  H,(a(z,y)) cos [2rvay — vp(a,y)] (8-67) 
v=0 


where 
Hazy) = ed” [7, GU) exp (2xfe°T)J,(4rTAaf) df (8-68) 


The particular form of the result (8-67) is very revealing. The term 
for y = 1 is the component which leads to the usual real and virtual 
images. The phase modulation ¢(z,y) of this component is identical with 
the phase modulation of the incident exposure distribution and therefore 
has not been affected by the nonlinearity. However, the amplitude modu- 
lation has been distorted in accordance with the nonlinear function 
H,(a(x,y)), which in principle can be evaluated for any particular non- 
linearity g(E£,). : 

In addition, harmonics of the spatial carrier a have been generated. 
The carrier frequency of the yth harmonic is va, which implies that the 
wave components generated by these harmonics are traveling at ever- 
steeper angles with respect to the undiffracted light. The phase modulation 
of the vth harmonic is simply » times the incident phase modulation, and 
the amplitude modulation is the nonlinear function H,(a(x,y)) of the 
incident amplitude modulation a(z,y). These harmonics may be regarded 
as generating “‘higher order’ images of the original object, although such 
images will in general be highly distorted due to the incorrect amplitude 
and phase distributions associated with the corresponding wave compo- 
nents generated by the transparency. 

While the general form of the transmittance of the nonlinear trans- 
parency has been found, there exist two additional problem areas which 
we have not covered: (1) the evaluation of the functions H,(a) for some 
reasonable model of the film nonlinearity g(£), and (2) the determination 
of effects of these distortions on the images themselves, rather than just 
the effects on the transmitted wave components. We shall treat neither of 
these problems in detail here. However, special cases of both will be con- 
sidered in the following discussion of a “hard-limiting”’ film. In addition, 
the reader may consult the work of Kozma [Ref. 8-19], in which the so- 
called error-function limiter model is used to determine the functions H,. 
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Figure 8-22 Amplitude transmittance- 
vs.-exposure curve for an 
ideal hard-limiting film. 


The direct method of analysis: the hard limiter 


While the transform method of analysis is attractive because of its gener- 
ality, in some cases a more direct approach can yield simple results with 
relative ease. An example of this direct approach will be presented here 
for the case of a hard-limiting film. By this we mean a film with the trans- 
mittance-exposure curve illustrated in Fig. 8-22. For all exposures below 
a certain threshold Hy the amplitude transmittance remains unity, while 
for all exposures above E>» the transmittance drops to zero. Such a model 
is at best a rough approximation to a very high gamma film. It is perhaps 
a better approximation to the nonlinear characteristic encountered when 
a hologram is scanned, analog-to-digital-converted, and quantized to two 
levels electronically. 

To analyze this model we make two major assumptions. First we 
assume that the ‘‘carrier frequency” @ in Eq. (8-60) ismuch larger than the 
spatial bandwidths of a(x,y) and ¢(z,y). If this is the case, then in any 
region which is small compared with the reciprocal bandwidths of a(x,y) 
and ¢(x,y), the exposure may be treated as the sum of an approximately 
constant bias H, + a®7, plus sinusoidal fringes of spatial frequency a 
with approximately constant amplitude 2Aa7 and phase ¢. Second, we 
assume that the reference exposure H, is chosen to exactly equal the 
threshold exposure Hy. The transformation of exposure variations into 
transmittance variations may then be described by 


1 when a27 + 2AaT cos (2ray — ¢) < 0 
b= .t4 when a27 + 2AaT cos (2ray — ¢) = 0 
0 otherwise 
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Figure 8-23 The nonlinear transformation of (a) the exposure variations into 
(b) the transmittance variations. 


Figure 8-23 illustrates the nonlinear action of the film and defines the 
important quantities of concern. We represent the period of the periodic 
transmittance function ¢(y) by L = a7, and the width of the unit-ampli- 
tude transmittance pulse by /. The displacement of the center of the pulse 
from the y origin is ¢L/27. Note that the amplitude modulation a(z,y) of 
the incident fringe pattern is not entirely lost by the limiting action, for it 
affects the width / of the transmittance pulses. Thus amplitude informa- 
tion is retained, undoubtedly in a distorted form, through a pulse-width 
modulation of the transmittance. Similarly the phase modulation $(,y) 
affects the position of the transmittance pulses with respect to the space 
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origin. Thus phase information is retained in the form of a pulse-position 
modulation. 

Before determining the effects of the nonlinearity on the various wave 
components transmitted by the hologram, we first determine the exact 
relationship between the pulse width / and the amplitude a of the incident 
object waves. This relationship can be found with the help of Fig. 8-24. 
First note that the phase angle 6 defined on part (a) of the figure is 
given by 


6=—- (8-69) 


In addition, with the help of the phasor diagram in part (b), it follows 
that @ is given by 


aT a 
— 0 se Sea loca 8-70 
6 = cos Ga cos (=) ( ) 
Hence, combining (8-69) and (8-70), 
L a 
ee oe 8-71 
l ie cos (=) ( ) 
Exposure 


aT 

ie l 6 

| at Fras igs 4 
ct EES SS a Se ee 


(a) 


| 
| 
| 
| 
| 
| 
| 


Sent sd al 
—< 2AaT 


(b) 


Figure 8-24 Determination of the pulse width l. 
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This result illustrates the nonlinear manner in which the amplitude modu- 
lation a(x,y) is converted to a pulse-width modulation /(x,y). 

To determine the amplitude and phase modulation of the various 
wave components transmitted by the hologram, we adopt a “narrowband” 
approach, Fourier analyzing the transmittance t(z,y) as if it were per- 
fectly periodic in y and independent of #, and then allowing the functions 
a and ¢ to vary slowly in the solution. Such an approach is an accurate 
one if the first of our two assumptions outlined above is satisfied. The 
periodic function t(z,y) = t(y) is thus expanded in the Fourier series 


ty = Y Ce (57) (8-72) 


yp=—o 


where 
i Qrvy 
athe us t(y) exp( —Jj “fa dy (8-73) 


With a change of variables y’ = y — (oL/2z), the transmittance pulse is 
centered on the origin of the y’ coordinate system, yielding 


(f, = meer) le (i me) ay (8-74) 


1/2 


Performing the integration we obtain 


i i) 
is v 

Cc, = 1 ; (8-75) 
* sin (x0) exp (—7v¢) vy #0 
Ty DL 


Combining this result with Eqs. (8-71) and (8-72), we find! 


(Gy) = feos (% “)+ Pye eit E COs; (=)| cos (Qrvay — vd) 


(8-76) 


where it is now understood that a and ¢ are slowly varying functions of 
(x,y). If this result is compared with the previous expression, Eq. (8-67), 
derived by the transform method, it is clear that the functions H, for the 


1 The simplicity of this result, and the direct method by which it may be obtained, 
were first pointed out to the author by his student, Gordon Knight. 
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Figure 8-25 Distortion of the amplitude-modulation 
(a) first-order component and (6) 
second-order component. 


hard limiter are given by 


1 1 a 

5 ze (2) i == (0) 

_ S sin |» cos’! Gal va) 
Tv QA 


The Fourier component of primary interest is that which generates 
the first-order images, i.e., the »y = 1 component. Noting that 


sin cos (=) = 1 — (=) (8-78) 


we can readily see that the corresponding component of transmittance is 


eye (8-77) 


t(x,y) = aM tle Beal cos [2ray — $(z,y)] (8-79) 
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Thus the amplitude modulation of the carrier is distorted in the manner 
illustrated in Fig. 8-25a, while the phase modulation remains undistorted. 
Note that when a < 2A, all amplitude modulation is removed, and the 
film acts in a fashion exactly analogous to the bandpass limiter of com- 
munication theory [Ref. 8-20, p. 288]. However, when a is comparable 
with 2A, the changing bias a?7' introduced by the object wave results in 
the presence of a distorted amplitude modulation of fi. 

As a final note of interest, we consider the component of transmit- 
tance which generates the second-order images, 


hing) = ai E cos} (=) cos [21r(2a)y — 2¢] (8-80) 


= 


f(s) lor) 
nh = (=) (8-81) 


1 2 
to(a,y) = SE. . (=) cos [2r(2a)y — 24] (8-82) 


The distortion of the amplitude modulation for this wave component is 
illustrated in Fig. 8-25b. Note that for small (a/2A), the amplitude 
modulation is undistorted! However, the phase modulation is twice what 
it should be, and, as a consequence, distortion of the second-order image 
can be expected. The reader is referred to Prob. 8-7 for an illustration of 
a technique by means of which both the correct amplitude modulation and 
the correct phase modulation can be recovered when a « 2A. 


Using the relation 


a 
ia |) 22 GORA! === 
8 | c | 


we find 


Effects of nonlinearities on the reconstructed images 


Determination of the effects of nonlinearities on the reconstructed images 
themselves is in general a difficult problem, and one that we shall not 
treat in detail here. However, a few simple observations are in order. 
First, for a hard-limiting film, exposed such that a <« 2A, many of 
the known characteristics of the bandpass limiter of communication 
theory [Ref. 8-23] may be used to predict the effects on the first-order 
image. This approach has been investigated by Friesem and Zelenka 
[Ref. 8-24], who demonstrated both analytically and experimentally 
several important effects. If the object consists of two point sources, one 
of greater amplitude than the other, small-signal suppression effects are 
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anticipated and observed, i.e., the amplitude of the image of the weaker 
point source is suppressed relative to that of the stronger point source. 
In addition, owing to intermodulation effects, false images may be gen- 
erated by the interaction of the two point sources, yielding apparent 
images of point sources that are not actually present on the object. 

The effects of film nonlinearities when the object is diffusely illu- 
minated have also been investigated [Refs. 8-25, 8-26]. In this case the 
exposure is most appropriately treated as a random process, employing 
techniques of analysis which are beyond the scope of our discussions bere. 
For further details the reader may consult the references cited. 

Finally we present photographs which illustrate the nature of the 
image degradations that are encountered in practice. In Fig. 8-26a we 
see a sketch of the geometry used to record the particular holograms of 
interest here. The object is simply a square aperture in an opaque screen, 
backlighted through a diffuser. The reference emanates from a point 
coplanar with the object; thus the lensless Fourier transform geometry 
is used. The hologram is recorded on Kodak Spectroscopic Plate type 
649F. The ratio of reference intensity to object intensity at the recording 
plane was in all cases 1.4 to 1. In Fig. 8-266 we see the two first-order 
images (plus the zero-order term) obtained when the average exposure 
is chosen to produce a bias amplitude transmittance of 0.5. This bias 
affords the maximum linear dynamic range on the ¢-K curve, and non- 
linear effects are accordingly quite negligible here. Figure 8-26c shows 
the images obtained when the exposure time is increased to produce a 
bias transmittance of 0.24, other factors (including the reference-object 
ratio) remaining unchanged. The ¢-F curve is driven over a highly non- 
linear range in this case, and the results are quite evident. First, the 
presence of second-order terms is readily observed. More important, the 
first-order images now appear to consist of a crisp image of the object 
superimposed on a broad background glow. This background is directly 
attributable to the intermodulation distortion generated by the film 
nonlinearities [Ref. 8-26]. 


8-7 EFFECTS OF EMULSION THICKNESS 


It has been assumed in the previous discussions that the photographic 
emulsion has negligible thickness and that the recording of a hologram is 
strictly a “surface” phenomenon. In fact, the emulsion always has some 
nonzero thickness, and it is important to consider the possible effects 
of this thickness on the imaging process. 

As a typical example, the Kodak Spectroscopic Plate type 649F 
widely used in holography has a typical emulsion thickness of about 
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Figure 8-26 Effects of film nonlinearities on the reconstructed images. (a) Recording 
geometry; (b) images obtained when t, = 0.5; and (c) images obtained 
when t = 0.24. 


16 microns (1 micron = 10~° meter). At first glance this might appear to 
be an entirely negligible dimension; however, it should be remembered 
that the period of the fringes incident on the surface of the emulsion may 
often be comparable with one optical wavelength, which for the usual 
red line of the HeNe laser is about 0.6 micron. Thus the emulsion thick- 
ness may sometimes exceed 20 fringe periods. The ‘‘volume’’ aspects of 
the recording process must certainly be considered in such cases. 
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Our treatment of this subject will closely follow that of Collier [Ref. 
8-10] to which the reader is referred for additional details. 


An elementary hologram 


We consider first the very simple case of a plane reference wave and a 
plane object wave incident on an emulsion of nonnegligible thickness. 
These two simple waves may be regarded as generating an elementary 
hologram. 

With reference to Fig. 8-27, it is assumed for simplicity that the 
two wave normals (represented by arrows) are each at angle @/2 to the 
surface normal, the total angle between them being 6. Wavefronts, or 
successive lines of zero phase, are shown dotted; the wavefronts of any 
one wave are separated by a normal distance equal to the optical wave- 
length. Along the lines (i.e., points in this two-dimensional figure) within 
the emulsion where the wavefronts of the two waves intersect, the two 
amplitudes add in phase, yielding a large exposure. As time progresses, 
the wavefronts move in the direction of their respective wave normals, 
and the lines of constructive interference move through the emulsion, 
tracing out planes of high exposure. Simple geometry shows that these 
planes bisect the angle @ between the two wave normals and that they 
occur periodically throughout the emulsion. 


“Object” 
wave normal 


“Reference” 
wave normal 


Figure 8-27 Recording an elementary hologram with a thick emulsion. 
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Figure 8-28 Reconstruction geometry. 


If the photographic plate is developed, silver atoms will appear in 
platelets along the planes of high exposure. With the help of the heavy 
triangle in Fig. 8-27, we can readily show that the spacing A of the silver 
platelets satisfies 


2A sins = X (8-83) 


“= 


The Bragg condition 


Suppose that we attempt to reconstruct the original object plane wave 
by illuminating the elementary hologram with a reconstruction plane wave. 
The question naturally arises as to what angle of illumination should be 
used to obtain a reconstructed object wave of maximum intensity. To 
answer this question, we may regard each silver platelet as a partially 
reflecting mirror which diverts part of the incident plane wave in accord- 
ance with the usual laws of reflection. If the plane-wave illumination is 
incident on the platelet structure at angle y, as shown in Fig. 8-28, then 
the reflected plane waves will travel in the direction indicated. However, 
if the various reflected plane waves are to add in phase, then it is essential 
that the path lengths traveled by waves reflected from adjacent platelets 
differ by precisely one optical wavelength!! With reference to the figure, 


1Path-length differences of any integer number of wavelengths will yield con- 
structive interference. We consider here only the first-order condition corresponding 
to one wavelength difference. 
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simple geometry shows that this requirement will be satisfied only if the 
angle of incidence y satisfies the so-called Bragg condition, 


: ae ¥ 
siny = + si (8-84) 


Comparison of Eqs. (8-83) and (8-84) now demonstrates that maximum 
intensity will be obtained only if 


Oy 2 or 


|e ES I) 


(8-85) 


This result is a very important one, for it indicates that to obtain a 
maximum-brightness reconstruction of the original object wave the hologram 
should be illuminated by a duplication ofthe original reference wave 
(¥ = +6/2). This reconstructed wave generates a virtual image of the 
object. We also see that by illuminating the hologram with the original 
object wave, a duplication of the reference is obtained (YW = —6/2). In 
addition, the Bragg condition is satisfied by waves traveling in the anti- 
directions of the original waves. If the hologram is illuminated by a wave 
traveling in the antidirection of the reference wave [Wy = —(a — 6/2)], 
then the object plane wave is generated traveling in the direction opposite 
to which it was incident. This “conjugate” object wave leads to a real 
image of the original object. Similarly, a real image of the reference 
may be obtained by illuminating with the conjugate of the object wave 
a, — 0/2). 

The main point of this entire discussion is that when the emulsion is 
“thick,” maximally bright real or virtual images will be obtained only if 
the hologram is properly illuminated. In order to obtain the virtual image, 
the hologram should be illuminated by a duplication of the reference wave. 
In order to obtain a real image, the hologram should be illuminated (from 
the opposite side) by the conjugate of the reference wave. These results 
are indicated in Fig. 8-29 for the simple case of a diverging spherical 
reference wave. 

Finally, it should be recognized that any given emulsion may behave 
either as a surface diffraction grating typical of a thin emulsion, or as 
a volume collection of platelets typical of a thick emulsion. The question 
as to whether surface or volume effects predominate is dependent on 
the thickness of the emulsion and on the angular spectrum of plane waves 
to which it is exposed. As a general rule of thumb, the emulsion may 
be regarded as thin if its thickness L is much less than the shortest 
periodicity A of platelets; likewise, it may be regarded as thick if L > A. 
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Figure 8-29 Obtaining real and virtual images from a “thick” 
hologram. (a) Recording the hologram, (6) obtaining 
the virtual image, and (c) obtaining the real image. 


There is, of course, an intermediate range where Bragg effects are present 
but not particularly noticeable. 
8-8 GENERALIZATIONS 


Brief discussions of a number of generalizations of our previous analyses 
are now presented. The interested reader should consult the references 
cited for more complete treatments of this subject matter. 
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Holography of moving objects [Ref. 8-27] 


Our previous analyses have in all cases assumed that the reference and 
object field distributions are constant with time, at least for the duration 
of the exposure interval. In some cases the object may be moving with 
time, with the result that the pattern of interference changes during the 
exposure interval. We consider now the generalization of the theory to 
include the effects of object motion. 

We assume that the reference source is monochromatic, and there- 
fore that the reference fields incident on the film may be written in com- 
plex representation as 


U,(x,y;t) = A(a,y) exp (—J2r»,t) (8-86) 


The object fields we allow to change with time, using a complex 
representation 

U.(x,y;t) = a(z,y;t) exp (—j2mvet) (8-87) 
The total exposure is thus 
T/2 


E(e,y) = E,(ey) + Boley) + AC) [oan 
+ A*(x,y) thee 4 a(z,y;t) di (8-88) 


a*(x,y;t) dt 


where H, and E#, are the exposures that would be produced by the reference 
and object alone, and T is the exposure interval. 

Restricting attention to the one component of exposure which is 
ultimately responsible for the virtual image, we write 


E’(x,y) = A*(x,y) oat a(zx,y;t) dt (8-89) 


This equation contains all the information necessary to predict the effects 
of object motion on the image. However, in many cases it is more con- 
venient to cast this result in an equivalent frequency-domain representa- 
tion. Specifically, we may write (using Table 2-1) 


2 t 
E’(ay) = A729) ie rect @ a(x,y;t) dt 
=A*@y)T is sine(T'v)Ga(a,y;v) dy (8-90) 
where G,(z,y;v) is defined by 
G.(2,y;v) = ee a(z,y;t) exp (—j2mvt) dt (8-91) 


and the generalized Parseval’s theorem [Ref. 8-28] has been used in 
writing the final equality. 
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From Eq. (8-90), it is evident that the effect of a uniform time 
exposure of duration 7’ seconds is entirely equivalent to that of a linear 
filtering operation with (temporal) transfer function 


H(y) = sine(T'v) (8-92) 


applied to a(z,y;t). The interpretation of this result is as follows: if the 
radiation incident on the film from a given object point is (uniformly 
over the film) » cycles/sec different than that of the reference, then the 
intensity of the corresponding image point will be suppressed by a factor 
sinc?(7'v). , 

This result will find application in our discussion of vibration 
analysis in Sec. 8-9. For other applications the reader may consult the 
original reference [Ref. 8-27]. 


Holography with spatially incoherent light 


While holography was originally conceived as a means of coherent image 
formation, certain techniques now exist by means of which holograms 
of incoherently illuminated objects may be recorded. The extension of 
holographic techniques to the incoherent case was first suggested by 
Mertz and Young [Ref. 8-29]. The theory and practice of incoherent 
holography were later extended by Lohmann [Ref. 8-30], Stroke and 
Restrick [Ref. 8-31], and Cochran [Ref. 8-32]. 

The light from any one point on an incoherently illuminated object 
will not, of course, interfere with the light from any other object point. 
Nonetheless, if by means of some suitable optical trick the light from 
each object point is split into two parts, then it is possible for each pair 
of waves of common origin to interfere and form a fringe pattern. Thus 
each object point may be encoded into a suitable system of fringes, and 
if the encoding is a unique one, with no two object points generating 
identical fringe patterns, then in principle an image of the object can 
be obtained. 

While a variety of optical systems for achieving the required splitting 
of the object waves are known [Refs. 8-30, 8-31], we illustrate here with 
one particular system suggested by Cochran [Ref. 8-32]. As shown in 
Fig. 8-30, the system consists of a triangular interferometer, in which 
are placed two lenses Li and L» of different focal lengths f; and f2. We 
assume here that both lenses are positive, although a combination of 
one positive and one negative lens may also be used. The lenses are 
separated by a path length fi + fs, their focal points coinciding at the 
point P in the figure. Plane A and plane B each lie a path length fi from 
lens L; and a path length f, from lens Ly. 
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Figure 8-30 Triangular interferometer for incoherent holography. 


Light may travel from plane A to plane B along either of two paths, 
one clockwise around the interferometer and the second counterclock- 
wise. Considering first the clockwise path, light travels a distance f; from 
plane A to lens L; by means of a reflection at the beam splitter BS. From 
L; to Ly the path length is f: + fo, and from Ly to plane B (again by 
means of a reflection at BS) the path length is fo. Because of the par- 
ticular choice of path lengths in relation to the focal lengths f; and fo, 
plane A is imaged onto plane B; due to the particular sequence in which 
L, and LZ, are encountered on this path, the imaging is performed with a 
magnification M@, = —fo/f1. 

For the counterclockwise path, light is in each case transmitted 
(rather than reflected) by the beam splitter. Again plane A is imaged 
onto plane B, but for this path the lenses are encountered in opposite 
sequence, and the magnification is WZ. = —fi/fo. 

Consider now the single point 0 (Fig. 8-30) of an incoherent object 
located at distance z from plane A. Regarding the light from that one 
point as providing a phase reference, we may express the resulting 
spherical wave (wavefront Wo of Fig. 8-30) incident on plane A as the 
complex function 


U.(z,y) = Us exp E = (2? + v)| (8-93) 


At plane B we find two spherical waves (wavefronts W; and W» of Fig. 
8-30), one magnified by M@, = —f2/f: and the second by My = —fi/fo. 
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Thus the total amplitude is 
ae REN rall 
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The corresponding intensity distribution is 


4 
ssey) = [U? + [Va? + 210 Ual eos ] = (2) ct + v9) + 4 | 
(8-95) 
where we have used the relation 


1 1 i 2 


M2— M2 fife? 


If a photographic plate is exposed by the intensity pattern (8-95) 
and processed to produce a positive transparency with amplitude trans- 
mittance linearly proportional to exposure, the resulting transmittance 
may be written 


t(x,y) = ty + B’U,U exp ( |= (Ase z ne ) oe = v |} 


+ B’UTU, exp {=i |= (A) (a? + | (8-96) 


We recognize the second and third terms as the transmittance functions 
of negative and positive lenses [cf. Eq. (5-10)], respectively, each of 
focal length 


fvfe? 

b ft — fe 2 (8-97) 
Thus, if the transparency is illuminated by a coherent source, both a 

virtual and a real image of the original point source are formed. 
Generalizing now to an object consisting of a multitude of mutually 
incoherent point sources, each such source generates its own fringe 
pattern on the film. Since the various sources are not coherent, the total 
intensity is found simply by adding the various intensity patterns so 
generated. The (x,y) coordinates of each point source determine the 
center of the corresponding pattern of fringes, and therefore fix the (x,y) 
coordinates of the real and virtual images. Similarly, the z coordinate 
of the point source influences the focal length of its contribution to the 


transmittance function [see Eq. (8-97)], and the image formed is thus a 
three-dimensional one. 
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Although the possibility of using incoherent illumination rather than 
coherent illumination is an attractive one in many applications, there 
exists a serious practical problem which has limited the usefulness of 
incoherent holography to date. The problem arises because each ele- 
mentary fringe pattern is formed by interference of two extremely tiny 
portions of the total light incident on the film. Whereas for coherent 
holography, light from each object point interferes with all the light 
contributed by the reference, for incoherent holography the interfering 
waves represent only a minute fraction of the total light. Thus a large 
“bias” exposure is always present. As a consequence of this bias problem, 
incoherent holography has been successfully applied only to objects com- 
posed of a relatively small number of resolution cells. The future utility 
of incoherent holography rests to a large degree on the invention of 
simple techniques for overcoming this problem. 


Reflection holograms 


By a suitable modification of the techniques of coherent holography, it 
is possible to record holograms in (approximately monochromatic) 
coherent light and to obtain images under “white light” illumination. In 
this case the reconstructed wavefronts are obtained by reflection from 
the hologram, rather than by transmission through it. The technique 
utilizes the three-dimensional properties of the photographic emulsion 
and is closely related in many fundamental respects to the Lippmann 
method of color photography [Ref. 8-33]. 

The basic ideas behind this method of holography were first pre- 
sented in 1962 by Y. N. Denisyuk in the Soviet Union [Ref. 8-34]. The 
concept was not fully appreciated in the United States until the publica- 
tions of Stroke and Labeyrie [Ref. 8-35] and Lin et al. [Ref. 8-36] in 1966. 
A rather complete analysis of this type of holography has been presented 
by Leith et al. [Ref. 8-37]. 

To record a reflection hologram, the coherent object and reference 
waves are introduced from opposite sides of the photographic emulsion, 
as shown in Fig. 8-3la. As implied by our previous discussion in Sec. 
8-7, the interference of the two waves yields (after the development 
process) a transparency containing stratified layers of metallic silver 
which act as reflecting planes. Since the angle 6 is near 180°, the reflecting 
planes (which bisect @) run nearly parallel to the surface of the emulsion. 
The spacing of the planes is, from Kq. (8-83), 


(8-98) 
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Figure 8-31 Reflection holography. (a) Recording the hologram; (b) 
obtaining a virtual image; and (c) obtaining a real 
image. 


To obtain a virtual image of the original object, we must illuminate 
the developed transparency with a duplication of the reference wave. As 
indicated in Fig. 8-316, the original object wavefront is then reproduced, 
which must in this case mean that the image is formed from reflected light. 
We also note that, for a given illumination and observation geometry, 
there is only a single reconstructing wavelength which will satisfy the 
Bragg law. Thus if white illumination is used, light of the wavelength 
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required by the Bragg law is reflected, while the remainder passes through 
the emulsion or is partially absorbed. 

To obtain a real image, the transparency is illuminated from the 
opposite side, and again the image forms from single-color reflected light, 
as shown in Fig. 8-31c. 

In principle we would expect the color of the images to be about the 
same as the color of the ight used for recording the hologram. In practice, 
the wavelength of the reflected light is shorter than that of the exposing 
light, the reason being that the emulsion shrinks during the development 
and fixing processes, and the silver layers are more closely spaced than 
anticipated. If duplication of the original wavelength is desired (as 
indeed it is for multicolor holography [Ref. 8-36]), then steps can be 
taken to prevent shrinkage. Omission of the fixing process eliminates 
most shrinkage, but the developed plates then slowly deteriorate. Perhaps 
the most satisfactory solution lies in certain techniques for reswelling 
the emulsion after the fixing process. By careful application of these 


Figure 8-32 Photograph of the virtual image produced by a reflection hologram 
illuminated by white light. 
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techniques, the emulsion can be made to reflect at any predetermined 
wavelength within a broad spectral region. 

In Fig. 8-32 a photograph of the virtual image produced by a reflec- 
tion hologram is shown. The hologram was recorded with the 6328 A 
light of a helium-neon laser. Without application of emulsion-swelling 
techniques, the (fixed) hologram reflects green light. 


8-9 APPLICATIONS OF HOLOGRAPHY 


The improvements and refinements of the wavefront-reconstruction 
process which developed during the early 1960s have led to a growing 
number of applications of holography. We consider here several of the 
most significant of these applications. Attention is limited to optical 
holography; the reader should also be aware that applications of the 
process are developing in the microwave and radio-frequency regions of 
the spectrum [e.g., Ref. 8-38], as well as in the field of acoustic imaging 
[Ref. 8-39]. 


Microscopy 


Microscopy has historically been the potential application of holography 
which motivated most of the early work on wavefront reconstruction; 
it was certainly the chief motivating force behind the early works of 
Gabor [Refs. 8-1, 8-3, 8-4] and El-Sum [Ref. 8-6]. 

In the present state of development, a sufficient number of new appli- 
cations of holography have been found so that microscopy can no longer 
be said to be the prime motivating force that it once was. In addition, 
while the use of holograms for microscopy has been amply demonstrated, 
it is clear that the new techniques are not serious competitors with the 
conventional microscope in ordinary, run-of-the-mill microscopy. None- 
theless, there do exist two areas in which holography offers a unique 
potential for microscopy. First is in the area of high-resolution volume 
imagery, which is treated in more detail in a later section. In conventional 
microscopy, high lateral resolution is obtained only at the price of a 
limited depth of focus. Holograms offer the potential of microscopy with 
relatively enormous depth of focus, a consequence of the three-dimen- 
sional imaging capability. 

A second application in the field of microscopy, which must be said 
to be at too early a stage for full evaluation, is the potential use of 
holograms to construct an x-ray microscope. The basic concept here is 
the use of x-ray illumination for the recording of a hologram and the 
use of optical illumination during the reconstruction process. Enormous 
magnifications can be achieved by this change of wavelength [cf. Eq. 
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(8-46)]; but more important, the use of x-ray wavelengths for illumination 
could yield resolutions of a few angstroms (1 A = 107 meter). Such 
resolutions are comparable with those achievable in electron microscopy, 
but the use of x-rays might yield less sample heating than is produced by 
an electron beam, and might also allow operation without the vacuum 
required in electron microscopy. The success or failure of holographic 
x-ray microscopy undoubtedly rests on future developments in the source 
and sensor technology. 


Interferometry 


Holography offers the capability of performing several rather unique 
kinds of interferometry. The ability to perform interferometry by wave- 
front reconstruction rests on the fact that the images formed are coherent, 
with well-defined amplitude and phase distributions. Any use of holog- 
raphy to achieve the superposition of two coherent images will result in 
a potential method of interferometry. 

The most powerful holographic interferometry techniques are based 
on a property, emphasized by Gabor et al. [Ref. 8-40], that, by means of 
multiple exposures of holograms, coherent additions of complex wave- 
fronts can be achieved. This property can be easily demonstrated as 
follows: Let a photographic emulsion be exposed sequentially by N 
different intensity distributions 91, 52, . . . , Jy. The total exposure to 
which the emulsion is subjected may be written 


N 
E= > Pegs (8-99) 
k=1 


where 7, T2, . . . , Ty are the N individual exposure times. Now suppose 
that during each individual exposure interval, the incident radiation is 
the sum of a fixed reference wave A(z,y) and an object wave a;(2,y) 
which changes from exposure interval to exposure interval. The total 
exposure becomes 


N N N N 
E=)Y TA? + Y Tila? + Y TA*a, + ¥ T,Aaz (8-100) 
k=1 k=1 k=1 k=1 


Assuming operation in a linear region of the t-E curve of the emul- 
sion, we find a component of transmittance 


N 
t. = B » T,A*ay (8-101) 

k=1 
and a component tj=6 ) TiAa; (8-102) 


256 / INTRODUCTION TO FOURIER OPTICS 


(0) 


Figure 8-33 Hologram interferometry with a Q-switched 
ruby laser. [By permission of R. E. Brooks, 
L. O. Heflinger, and R. F. Wuerker.| 
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From Eq. (8-101) it is clear that illumination of the transparency by a 
wavefront A will generate a transmitted field component proportional to 
a weighted sum of the complex wavefronts aj, a2, . . . , ay. AS a conse- 
quence, N coherent virtual images of the original objects producing the 
a’s will be linearly superimposed and will, of course, mutually interfere. 

In a similar fashion, illumination of the transparency by a wavefront 
A* will, from Eq. (8-102), generate N coherent real images which will 
likewise interfere. 

The most dramatic demonstrations of the potential of this type of 
interferometry have been performed by Brooks et al. [Ref. 8-41] using a 
Q-switched ruby laser. Figure 8-33 shows two photographs obtained in 
each case by double exposure of a hologram with two laser pulses. In the 
case of part (a) of the figure, the first pulse records a hologram of only a 
diffuse background, while the second pulse records a hologram of a bullet 
in flight in front of the same diffuse background. The shock waves gen- 
erated by the bullet produce changes in the refractive index of the air. 
As a consequence, the two images of the diffuse background—one 
recorded in the absence of the bullet and the second recorded through the 
refractive-index perturbations of the air—will mutually interfere, pro- 
ducing interference fringes which outline the shock waves generated by 
the bullet. These fringes have the appearance of being fixed in three- 
dimensional space around the bullet. 

Part (b) of the same figure is a similarly obtained image of an 
incandescent lamp. During the first exposure the filament is off, and 
again a hologram of a diffuse background is recorded, this time through 
the glass envelope of the lamp. The filament is then turned on, and a 
second laser pulse exposes the hologram. The incoherent light generated 
by the lamp does not interfere with the laser light, so the filament does 
not appear lighted in the final image. However, the heating of the gases 
within the envelope has resulted in changes of index of refraction, which 
again will generate fringes of interference in the final image, outlining 
the patterns of gas expansion. It should be emphasized that these inter- 
ference fringes have been obtained in the presence of the optically im- 
perfect glass envelope, a feat which would be impossible by other “clas- 
sical’? methods of interferometry. 


Vibration analysis 


A holographic technique for performing vibration analysis was first pro- 
posed and demonstrated by Powell and Stetson [Ref. 8-42]. This tech- 
nique may be regarded as a generalization of the multiple-exposure 
interferometry method to the case of a continuous time exposure of a 
vibrating object. 
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Figure 8-34 Recording a hologram of a vibrating object. 


An analysis of this technique may be easily performed with the help 
of the temporal filtering property of time-exposed holograms, as derived 
in Sec. 8-8. With reference to the geometry of Fig. 8-34, we consider a 
point at coordinates (z,,y.) on a planar object which is vibrating sinus- 
oidally with angular frequency 2. The peak amplitude of the vibration 
is represented by m/(2,,y.), and its fixed phase by u(a.,y.). The light 
incident at film coordinates (x,y) from that object point may be regarded 
as having the time-varying phase modulation 


Att) = = (cos 6; + cos 62)m(2o,Yo) Cos [Qt + u(2.,Yo)] (8-103) 


where \ is the optical wavelength of the illuminating source, 6: is the 
angle between the vector displacement and the line joining (2,,y.) with 
(x,y), and 62 is the angle between the vector displacement and the direc- 
tion of propagation of the light incident at (x,y). As a consequence of 
this sinusoidal phase modulation, the frequency spectrum F(v) of the 
time-varying phasor light incident at (#,y) may be expressed in the usual 
Bessel function expansion of a sinusoidally phase-modulated wave, 


6 
ae = S i | 2 pce aie (zo) | 5 ( ss =) (8-104) 


k=—o X 


When the exposure time is much longer than the vibration period (that is, 
T > 21r/), only the k = 0 or “carrier” term falls within the passband 
of the transfer function (8-92). If the variations of modulation depth 
introduced by the term cos 6; are nearly independent of (x,y) [that is, 
if the angle subtended by the film at (x,,y.) is small], then the amplitude 


WAVEFRONT-RECONSTRUCTION IMAGING, OR HOLOGRAPHY / 259 


of the image of (a,,y.) will be suppressed by the factor 


2 
se E (cos 6; + cos oan (co) (8-105) 


and the intensity will be suppressed by the square of this factor. Thus the 
intensity of the image depends, at each point, on the depth m(a,,y.) of 
the vibration at the corresponding object point. 

Figure 8-35 shows images of a vibrating diaphragm obtained experi- 
mentally by Powell and Stetson. In part (a) of the figure the diaphragm 
is vibrating in its lowest order mode, with a single vibration maximum in 
the center of the diaphragm. In part (b) the diaphragm is vibrating in a 
higher order mode, with two vibration maxima. By counting the number 
of fringes from the edge of the diaphragm to any point in question, it is 
possible, with the help of Eq. (8-105), to determine the vibration ampli- 
tude at that point. 


High-resolution volume imagery 


As mentioned previously in the section on microscopy, one of the most 
useful properties of holographic images is their enormous potential depth 
of focus. If a conventional imaging system (e.g., a microscope) is designed 
to have high transverse resolution, then for very fundamental reasons 


(0) 

(a) 

Figure 8-35 Holographic images of a diaphragm vibrating in two different modes. 
[By permission of R. L. Powell and K. A. Stetson. 
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there is only a limited volume of the object space which can be brought 
‘Gn focus” at one time. It is, of course, possible to explore a large volume 
in sequence by continuously refocusing to explore new regions of the 
object volume, but such an approach is often unsatisfactory if the object 
is a dynamic one, continuously in motion. 

A solution to these problems may be obtained by recording a holo- 
gram of the object using a pulsed laser to obtain short exposure times. The 
dynamic object is then “frozen” in time, but the recording retains all 
the information necessary to explore the full object volume. The hologram 
is simply illuminated and the real or virtual image is explored with an 
auxiliary optical system. Sequential observation of the image volume is 
now acceptable because the object (i.e., the holographic image) is no 
longer dynamic. 

This approach has been fruitfully applied by C. Knox in the micros- 
copy of three-dimensional volumes of living biological specimens [Ref. 
8-43], and by Thompson, Ward, and Zinky in the measurement of particle- 
size distributions in aerosols [Ref. 8-44]. The reader may consult these 
references for further details. 


Contour generation on three-dimensional objects 


The interference of multiple coherent images described previously has 
also led to the development of techniques for obtaining three-dimensional 
images with superimposed constant-range contours. These techniques 
are applicable to the problems of cross-section tracing and contour 
mapping. Two distinctly different techniques have been proposed by 
Hildebrand and Haines [Ref. 8-45]. In the first of these techniques, the 
object is illuminated by two mutually coherent but spatially separate 
point sources. The two object illuminations may be applied simulta- 
neously, or the hologram may be double-exposed, with a different position 
of the object source for each exposure. 

If the pattern of interference between the two object-illumination 
sources is considered, it is found to consist of bright and dark interference 
fringes which follow hyperbolas of constant path-length difference, as 
shown in Fig. 8-36. If the object is illuminated from the side and a 
hologram recorded from above, then depth contours are readily seen on 
the reconstructed image. Identical results are obtained whether the two 
sources are used simultaneously in a single exposure or individually in a 
double exposure, for in either case the results of the two illuminations 
are added coherently. 

The two-source method of contour generation suffers from the 
requirement that the directions of illumination and observation must 
differ by nearly 90°. Thus if the object has significant reliefing, shadows 
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will be cast and parts of the object will simply not be illuminated. This 
deficiency is overcome by the two-frequency method of contour genera- 
tion. In this case the object and the reference illumination each contain 
two frequency components. When the resulting hologram is illuminated 
by single-frequency light, two images with slightly different position and 
magnification are produced. These two images will interfere, and for cer- 
tain geometries the resulting image contours will be accurate indications 
of depth. We do not dwell on a detailed analysis of this more complicated 
case; the interested reader may consult the original reference for further 
details [Ref. 8-45]. Figure 8-37 illustrates the results of contour mapping 
by the two-frequency method. In part (a) we see a holographic image of a 
quarter, illuminated in the usual manner with single-frequency light. 
When two-frequency light is used to record the hologram, the image of 
part (b) is obtained. In this case the two frequencies were obtained from 
two different lines of an argon laser. The two laser lines were separated by 
65 A and the resulting contours on the image are spaced by 0.02 mm. 


Imaging through aberrating media 


In many cases of practical interest, an optical system may be required to 
form images in the presence of uncontrollable aberrations. These aberra- 
tions may result from imperfections of the image-forming components 
themselves, or they may be introduced by an external medium such as the 
atmosphere. The techniques of holography offer several unique advantages 
for problems of imaging in the presence of such aberrations. We discuss 
here three distinctly different holographic techniques for obtaining high 
resolution in the presence of severe aberrations. 
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Figure 8-386 Contour generation by the two-source 
method. 


Reference 


262 / INTRODUCTION TO FOURIER OPTICS 


(0) 


Figure 8-387 Contour generation by the two-frequency 
method. [By permission of B. P. 
Hildebrand and K. A. Haines.] 
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The first technique [Refs. 8-46, 8-47] of interest is applicable when the 
aberrating medium is constant in time and movable in space. As illustrated 
in Fig. 8-38, a hologram of the aberrated object waves is recorded with an 
unaberrated reference. If the hologram is illuminated in such a way as to 
generate a real image of the object, then a real image of the aberrating 
medium will likewise form between the hologram and the image plane. 
Now if the original object wave incident on the aberrating medium is 
represented by Uo(é,7), and if the amplitude transmittance of the aberrat- 
ing medium is exp [jW(£,n)], then the wave produced at the real image of 
the aberrating medium is the conjugate wave Uf(£,n) exp [—jW(&,n)]. 
Now note if the original aberrating medium is inserted during recon- 
struction such that it exactly coincides with its conjugate image, then the 
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Figure 8-38 Use of the original aberrating medium for compensating 
aberrations. (a) Recording the hologram; (b) reconstructing 
the wmage. 
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wave transmitted by the medium becomes 


U*(E,n) exp [—jW(E,n)] exp [7W(E,2)] = Us (én) 


Thus the aberrations have been exactly canceled, and an aberration-free 
image can be formed.! 

This technique would undoubtedly find application in cryptography, 
or the secure encoding of messages. The message to be encoded serves as 
the object. A diffuser is inserted between the object and the film plane. 
Illumination of the hologram in the ordinary manner will produce an 
image which is too badly aberrated to be read. However, the one person in 
possession of the same diffuser used during the recording process can, by 
properly inserting his ‘‘decoding plate,’”’ obtain an unaberrated image. 

A second technique of interest is illustrated in Fig. 8-39. In this case 
the aberrating medium may be fixed in space (i.e., immovable) but must 
be unchanging in time (or at best, very slowly changing). To obtain high- 
resolution images in the presence of aberrations, a hologram of the aber- 
rated wavefronts produced by a point-source object is recorded, using an 
unaberrated reference wave. This hologram may now be used as a “‘com- 
pensating plate” to enable a more conventional optical system to form an 
aberration-free image. Let the waves incident on the photographic film 
from the single point source be represented by exp [jW(z,y)]. We have 
assumed here that the aberrating medium is sufficiently close to the film 
so that intensity variations are not significant. Now the portion of holo- 
gram transmittance which normally contributes the real image is propor- 
tional to exp [—jW(zx,y)]. Thus if we replace the point source by a more 
general object, and reinsert the hologram in the same position where it 
was originally recorded, we find that the curvature of the object waves 
reaching the hologram is canceled on passage through the hologram, 
with the waves from different object points producing plane waves 
traveling at different angles. The lens then forms an unaberrated image in 
the usual manner. 

This technique will work well over only a restricted field of view, for 
if an object point is too far from the position of the original point source 
used in recording the hologram, the aberrations imparted to its wave may 
differ from the aberrations recorded on the hologram. This restriction is 
least severe when the hologram is recorded very close to the aberrating 
medium. Upatnieks et al. [Ref. 8-48] have successfully applied this 
technique to the compensation of lens aberrations, an application to 
which it is well suited. 


1'The accuracy with which the aberrating medium must be positioned during 
reconstruction may be very great indeed. To image through a finely ground diffuser, 
errors of even a few microns cannot be tolerated. 
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Figure 8-39 Use of a hologram compensating plate. (a) Recording the 
compensating plate; (b) cancellation of the aberrations. 


A third technique which may be applied to imaging through media 
which are movable or immovable, and time-varying or time-independent, 
is accomplished by passing both the reference wave and the object wave 
through the same aberrating medium. As indicated in Fig. 8-40, the 
lensless Fourier transform recording geometry is used. For simplicity it is 
assumed that the aberrating medium is located immediately in front of 
the recording plane. The reference and object waves reaching the film may 
thus be written as A(z,y) exp [7W(az,y)] and a(z,y) exp [7W(z,y)]. Inter- 
ference of the two aberrated waves leaves an interference pattern which is 
unaffected by the presence of the aberrating medium, 


S(x,y) = |A(x,y) exp [JW(a,y)] + a(z,y) exp [JW (x,y)]]? 
= |Al? + |al? + A*a + Aa* 


Thus the hologram is unaffected by the presence of the aberrating medium. 
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Figure 8-40 Aberration-free imaging when the reference 
and object waves are identically aberrated. 
(a) Recording the hologram; (b) obtaining 
the image. 


Again the technique will work over only a limited object field, since 
points too far from the reference may produce waves with aberrations 
different than those of the reference wave.! The working field is largest 
when the aberrations are introduced near the recording plane. For more 
details regarding this process, the reader may consult Ref. 8-49. 

Figures 8-41 through 8-44 illustrate experimental results obtained 


1 More specifically, a given point on the object will be imaged free of aberrations 
only if the rays drawn from that point, and from the reference to any point on the 
film, undergo identical path-length delays. 
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using this technique. The object was in this case a transparency contain- 
ing both a sign and a bright reference point, as shown in Fig. 8-41. The 
aberrating medium was common shower glass, as shown in Fig. 8-42. 
Figure 8-43 shows the wavefront-reconstruction images obtained in the 
presence of the aberrating medium. For comparison, Fig. 8-44 shows the 
conventionally formed images, obtained by replacing the film with a 
positive lens which directly forms the image in the presence of the aber- 
rating medium. Resolution in the conventionally formed image has been 


Figure 8-41 The object and reference. 
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Figure 8-42 The aberrating medium. 
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Figure 8-43 Wavefront-reconstruction images obtained 
in the presence of the aberrating medium. 


<4 -. Ps ‘ as oe fe rt ; a Wax i} 
Figure 8-44 Conventionally formed image obtained in 
the presence of the aberrating medium. 


totally destroyed by the aberrations. The wavefront-reconstruction 
images, however, have suffered only minor degradations. 


PROBLEMS 
8-1 Show that when a hologram of a planar object is recorded in a plane parallel 


with the object, the resulting images form in planes parallel with the hologram. 
(For simplicity, assume a plane reference wave.) 


8-2 


8-3 


8-4 


8-5 


8-6 


8-7 
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A hologram is recorded with light from an argon laser at 4880 A and the images 
are formed with light from a helium-neon laser at 6328 A. 

(a) Assuming zp = ©, z, = », and z = 10 cm, what is the image distance 2? 
(b) Assuming zp = ~, z, = 2z,, and z = 10 cm, what is z;? What is M? 


Show that when 2 = A; and zp = z;, there results a virtual image with unity 
magnification, whereas when \2 = \; and z, = —z,, there results a real image 
with unity magnification. 


A hologram is recorded in the manner illustrated in Fig. 8-10. The photographic 
transparency is then enlarged by a factor m. Show that the images formed from 
the enlarged hologram form at distances z; given by 


1 2 al Ae eo 
a= =u + 
Se \1zZ-m? A1Z0™m? 


Nozi 


and have magnification M = 


MXiZo 


The following table lists approximate cutoff frequencies for the MTFs of several 
types of film: 


Lines/mm 
Kodak Tri-X 50 
Kodak High-Contrast Copy 60 
Kodak SO-243 300 
Agfa Agepan FF 600 


Assume illumination of 6328 A wavelength and a lensless Fourier transform 
recording geometry with reference and object 10 cm from the film. For each 
film, estimate the radius of the circle (about the reference point) outside of which 
object points will fail to produce corresponding image points. 


A certain film has a nonlinear t-E curve which, over its region of operation, may 
be described by 
t — th ~ BE; 


where Z, represents the variations of exposure about the reference exposure. 
(a) Assuming a reference wave A exp (—j2ray) and an object wave 


a(x,y) exp [—j¢(z,y)] 


at the film, find an expression for that portion of the transmittance which 
generates the two first-order images. 

(b) To what does this expression reduce if A >a? 

(c) How do the amplitude and phase modulations obtained in parts (a) and (6) 
compare with the ideal amplitude and phase modulations present when the film 
has a linear t-E curve? 


A strong reference wave A exp(—j2ray) and a weak object wave a(x,y) 
exp [—j¢(z,y)] interfere on an ideal hard-limiting film (i.e., a film with the t-E 
curve shown in Fig. 8-22). A perfect copy of the resulting hologram is made by 
contact printing, yielding two identical holograms. 
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8-8 


8-9 


One of the holograms is illuminated by coherent light, and an optical system 
is inserted to image that hologram onto the second identical hologram, as shown 
in Fig. P8-7. A stop is inserted in the rear focal plane of the imaging lens such 
that the only component of light transmitted to the image plane is that arising 
from one of the two first-order transmittance terms of hologram 1. Show that, 
under the assumption a < A, one of the components of light transmitted by 
hologram 2 has an undistorted amplitude modulation a(z,y) and an undistorted 
phase modulation ¢(z,y), in spite of the fact that a hard-limiting film was used. 


nn Hologram 
Kr #2 
Lens 
ae ce Stop 
Figure P8-7 


The lensless Fourier transform geometry of Fig. 8-18 is used to record a holo- 
gram of a square transparency object of width L. The amplitude transmittance 
of the object is t.(x.,¥.) and the distance from object to film is z. The images are 
obtained with a positive lens of focal length f, as shown in Fig. 8-19. 

(a) What is the magnification M of the two first-order images? 

(b) Show that the amplitude of the zero-order image term (i.e., the light lying 
between the two first-order images) is proportional to the autocorrelation func- 


tion of 
Zo Yo 
Ls || Sage ¢ ° a 
€ ¥) a 3 ane ty | 


plus a central diffraction-limited spot. 
(c) How far from the center of the object transparency should the reference 


point source be placed in order to assure no overlap of the zero-order light with 
the first-order images? 


It is proposed to record an x-ray hologram using radiation of wavelength 1 A 
(10~% meter), and to reconstruct the images optically using light of wavelength 
6000 A. The lensless Fourier transform recording geometry is chosen, as shown 
in Fig. P8-9 (top). The width of the object is 0.1 mm, and the minimum distance 
between object and reference is chosen to be 0.1 mm to assure that the twin 
images will be separated from the “on-axis” interference. The x-ray film is placed 
2 cm from the object. 

(a) What is the maximum spatial frequency (cycles/mm) in the intensity 
pattern incident on the film? 

(b) Assume that the film has sufficient resolution to record all the incident 
intensity variations. It is proposed to reconstruct images in the usual manner 
shown in Fig. P8-9 (bottom). Why will this experiment be unsuccessful? 
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APPENDIX 


A. DIRAC DELTA FUNCTIONS 


The one-dimensional Dirac delta function, widely used in electric circuit 
analysis, can be defined as the limit of a sequence of pulses of decreasing 
width, increasing height, and unit area. There are, of course, a multitude 
of different pulse shapes that can be used in the definition; three equally 
acceptable definitions are 


6(t) = lim N exp (—N?zt?) (A-1a) 
N- 

6(t) = lim N rect(Nt) (A-1b) 
N= « 

6(t) = lim N sine(Nt) (A-Ic) 
N- © 


While the 6 function is used in circuit analysis to represent a sharp, 
intense pulse of current or voltage, the analogous concept in optics is a 
point source of light, or a spatial pulse of unit area. The definition of a 6 
function on a two-dimensional space is a simple extension of the one- 
dimensional case, although there is even greater latitude in the possible 
choice for the functional form of the pulses. Possible definitions of the 
spatial 6 function include 


ee Ne) (A-2a) 

(x,y) = oul N? rect(Nz) rect(Ny) (A-2b) 

6(x,y) = ane N? sine(Nz2) sine(Ny) (A-2c) 
2 

6(x,y) = lim SE eicclN Vx? + y?) (A-2d) 
Noo TW 


pea lm N Ji 2a V a? + 4”) 
lec) == ohban 
N- © V x? a Vi 


Definitions (A-2a) to (A-2c) are separable in rectangular coordinates, 


(A-2e) 


vk) 
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while definitions (A-2d) and (A-2e) are circularly symmetric. In some 
applications one definition may be more convenient than others, and the 
definition best suited for the problem can be chosen. 

Each of the above definitions of the spatial 6 function has the following 
fundamental properties: 


| ces z=y=0 : 

2) i otherwise Soa) 
lf 6(x,y) dx dy = 1 any e > 0 (A-4) 

[ff atema@ = & y — 9) dé dn = g(x,y) (A-5) 


at each point of continuity of g 


Property (A-5) is often referred to as the sifting property of the 6 function. 
An additional property of considerable importance can be proved from 
any of the definitions (cf. Prob. 2—1a), namely, 


Nese) = a (x,y) (A-6) 


There is, of course, no reason why the 6 function cannot be defined on 
a space of higher dimensionality than two, but the properties of such 
functions are exactly analogous to their counterparts on spaces of lower 
dimensionality. 


B. DERIVATION OF FOURIER TRANSFORM THEOREMS 


In this section, brief proofs of basic Fourier transform theorems are 
presented. For more rigorous derivations, the reader should consult Ref. 
2-1 or 2-2. 


1. Linearity theorem. F{ag + Bh} = aF{g} + BF{h} 
Proof: This theorem follows directly from the linearity of the 


integrals that define the Fourier transform. 


2. Similarity theorem. If F{g(2,y)} = G(fx,fy), then 


#{g(az,by)} = — (4 in) 
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Eroer: 


F{g(ax,by)} = if g(ax,by) exp [—j2r(fxx + fry)] dx dy 
= it g(ax,by) exp = jor (4 ax + fy wv) dax dby 


at ee lal [0 
1 ix fz 
~ lab o(% _) 


3. Shift theorem. If F{g(x,y)} = G(fx,fy), then 


Fig(e — a, y — b)} = G(fx,fr) exp [—j2n(fxa + fyb)] 
Proof: 


_ If g(x — a, y — b) exp [—j2m(fxx + fry)] dx dy 


= |f o(e'y) exp {—j2alfe(e! + a) + fel’ + 5))} ae! dy’ 


= G(fx,fy) exp [—j2n(fxa + fyb)] 
4, Parseval’s theorem. If 5{g(z,y)} = G(fx,fy), then 


lf Ig (x,y) |2 di dy = ff IG fx, fr) |? dfx dfy 


Proof: 
ff lg(x,y)|? dx dy - ff g(x,y) g*(x,y) dx dy 
i i uy ay | i dé dn G(E,n) exp [j2m(aé + vn) | 
l i da dB G*(a,8) exp [—j2r(na + y8)] | 
~ ff dé dy G(E,n) I da dB G*(a,8) 


[ ff exp {j2m[z(é — a) + y(n — B)]} da dy | 
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= ff ae dn (En) [] / da a8 G*(a,8)5(E — a, n — 8) 


= [fGen ag ay 


5. Convolution theorem. If F{g(x,y)} = G(fx,fy) and 
F{h(x,y)} = H(fx,fy), then 


x I g(émh(@ — & y — 1) dé da} = G(fx,fy)H(fx,f) 
Proof: 
5 (ff g(Em)h(@ — & y — 9) dé dn} 


= If g(ém)F{h(z — £, y — n)} dé dy 


= ff a(&n) exp [-s2m(fxé + frn)] dé dn Hfxsfv) 


= G(fx,fr)H(fx,fy) 
6. Autocorrelation theorem. If F{g(x,y)} = G(fx,fy), then 


ay { I g(En)g*(E — x,» — y) dé dn} = |G(fx, fy)? 


Proof: 
F { I g(En)g*(E — 2, 1 — y) dé dy} 
= § { I g(t’ +3, o yg (Eo) ae cry} 


= ff ae dy g*(e mn) 51 g(8 +2, 9 +y)} 


Jf ae da’ g*(e'91) exp [2a (xt! + fen) IGUxfy) 
= G*(fx,fr)G(fx,fy) = |GCfx, fy) |? 
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7. Fourier integral theorem. At each point of continuity of g, 


“Ha(ay)} = FF{g(axy)} = g(x,y) 
At each point of discontinuity of g, the two successive transformations 
yield the angular average of the value of g in a small neighborhood of 


that point. 
Proof: Let the function gr(x,y) be defined by 


ga(zy) = [f Gfxfy) exp Li2e(fxe + fry)] dfx dfy 
AR 


where A, is a circle of radius R, centered at the origin of the fxfy plane. 
To prove the theorem, it suffices to show that at each point of continuity 
of g, 


os Qr(xt,y) = g(x,y) 


and that at each point of discontinuity of g, 


Ly pea: 
lim gr(x,y) = — f go(0) dé 
Ro Qa 0 


where g.(@) is the angular dependence of g in a small neighborhood about 
the point in question. 
Some initial straightforward manipulation can be performed as 


follows: 
LAS [de dn gC&n) exp [—i2m( xt + fen} 
exp [j2r(fxx + fry)] dfx dfy 


[fae dn glen) [f atx afr exp (i2rlfxle — 8) + fry — li 
—« AR 


I 


gr(x,y) 


Noting that 


J\(2rRr) 
[ffx afr exp {i2alfx@ — 9 + fry — nl) =R | : 
Ar 


where r = V (x — £)2?-+ (y — n)’, we have 


Ink 
gr(x,y) = I dé dn g(Esn)h jae | 
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Suppose initially that (x,y) is a point of continuity of g. Then 
jee | 
(ig 


ff a ay (6) lim B 


= 6 


lim gr(x,y) 


RO 


[fa dn a(en)6@ — & y =n) = g(xy) 


where Eq. (A-2e) has been used in the second step. Thus the first part 
of the theorem has been proved. 

Consider next a point of discontinuity of g. Without loss of generality 
that point can be taken to be the origin. Thus we write 


eet J1(20Rr) 
ax(00) = ff ded glenn [PO] 


where r = V #2 + n°. But for sufficiently large R, the quantity in brackets 
has significant value only in a small neighborhood of the origin. In 
addition, in this small neighborhood the function g depends (approxi- 
mately) only on the angle @ about that point, and therefore 


gr(0,0) = ie g.() dé ie rR ja] dr 


s 
where go(#) represents the @ dependence of g about the origin. Finally, 
noting that 


7 


2 J,(2rR 
[fe [ PS) ae = 2 
0 7 Qa 
we conclude that 
; l 2m 
lim gn(0,0) = 5 |," g.(6) do 


and the proof is thus complete. 
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